ELITRA.001A PATENT 
GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION IN 

ESCHERICHIA COLI 

RELATED APPLICATIONS 
5 This application claims priority from U.S. Provisional Patent Application Serial 

Number 60/1 17,405 filed January 27, 1999. the disclosure of which is incorporated 
herein by reference in its entirety. 
' ' BACKGROUND OF THE INVENTION 

Since the discover}^ of penicillin, the use of antibiotics to treat the ravages of 
10 bacterial infections has saved millions of lives. With the advent of these "miracle drugs," 

for a time it was popularly believed that humanity might, once and for all, be saved from 
the scourge of bacterial infections. In fact, during the 1980s and early 1990s, many large 
pharmaceutical companies cut back or eliminated antibiotics research and development. 
They believed that infectious disease caused by bacteria finally had been conquered and 
1 5 that markets for new drugs were limited. Unfortunately, this belief was overly optimistic. 

The tide is beginning to tum in favor of the bacteria as reports of drug resistant 
bacteria become more frequent. The United States Centers for Disease Control announced 
that one of the most powerful known antibiotics, vancomycin, was unable to treat an 
infection of the common Staphylococcus aureus (staph). This organism is commonly 
20 found in our envirormient and is responsible for many nosocomial infections. The 

import of this announcement becomes clear when one considers that vancomycin was 
used for years to treat infections caused by stubborn strains of bacteria, like staph. In 
short, the bacteria are becoming resistant to our most powerful antibiotics. If this trend 
continues, it is conceivable that we will return to a time when what are presently 
25 considered minor bacterial infections are fatal diseases. 

There are a number of causes for the predicament in which practitioners of medical 
arts find themselves. Over-prescription and improper prescription habits by some 
physicians have caused an indiscriminate increase in the availability of antibiotics to the 
public. The patient is also partly responsible, for even in instances where an antibiotic is 
30 the appropriate treatment, patients will often improperly use the drug, the result being yet 

another population of bacteria that is resistant, in whole or in part, to traditional antibiotics. 
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The bacterial scourges that have haunted humanity remain, in spite of the 
development of modem scientific practices to deal with the diseases that they cause. Drug 
resistant bacteria are now advancing on the health of humanity. A new generation of 
antibiotics to once again deal with the pending health threat that bacteria present is 
5 required. 

Discovery of New Antibiotics 
As more and more bacterial strains become resistant to the panel of available 
antibiotics, new compounds are required. In the past, practitioners of pharmacology 
would have to rely upon traditional methods of drug discovery to generate novel, safe 

10 and efficacious compounds for the treatment of disease. Traditional drug discovery 

methods involve blindly testing potential drug candidate-molecules, often selected at 
random, in the hope that one might prove to be an effecti\^e treatment for some disease. 
The process is painstaking and laborious, with no guarantee of success. Today, the 
average cost to discover and develop a new drug is nearly US $500 million, and the 

15 average time is 15 years from laboratory to patient. Improving this process, even 

incrementally, would represent a huge advance in the generation of novel antimicrobial 
agents. 

Newly emerging practices in drug discovery utilize a number of biochemical 
techniques to provide for directed approaches to creating new drugs, rather than 

20 discovering them at random. For example, gene sequences and proteins encoded 

thereby that are required for the proliferation of an organism make for excellent targets 
since exposure of bacteria to compounds active against these targets would result in the 
inactivation of the organism. Once a target is identified, biochemical analysis of that 
target can be used to discover or to design molecules that interact with and alter the 

25 functions of the target. Using physical and computational techniques, to analyze 

structural and biochemical targets in order to derive compounds that interact with a 
target is called rational drug design and offers great future potential. Thus, emerging 
drug discovery practices use molecular modeling techniques, combinatorial chemistry 
approaches, and other means to produce and screen and/or design large numbers of 

30 candidate compounds. 

Nevertheless, while this approach to drug discovery is clearly the way of the 
future, problems remain. For example, the initial step of identifying molecular targets 
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for investigation can be an extremely time consuming task. It may also be difficult to 
design molecules that interact with the target by using computer modeling techniques. 
Furthermore, in cases where the function of the target is not known or is poorly 
understood, it may be difficult to design assays to detect molecules that interact with 
5 and alter the functions of the target. To improve the rate of novel drug discovery and 

development, methods of identifying important molecular targets in pathogenic 
microorganisms and methods for identifying molecules that interact with and alter the 
functions of such molecular targets are urgently required. 

Escherichia coli represents an excellent model system to understand bacterial 

10 biochemistry and physiology. The estimated 4288 genes scattered along the 4.6 x 10^ 

base pairs of the Escherichia coli (E. coli) chromosome offer tremendous promise for 
the understanding of bacterial biochemical processes. In turn, this knowledge will assist 
in the development of new tools for the diagnosis and treatment of bacteria-caused 
human disease. The entire E. coli genome has been sequenced, and this body of 

15 information holds a tremendous potential for application to the discovery and 

development of new antibiotic compounds. Yet, in spite of this accomplishment, the 
general functions or roles of many of these genes are still unknown. For example, the 
total number of proliferation-required genes contained within the E. coli genome is 
unknown, but has been variously estimated at around 200 to 700 (Armstrong, K.A. and 

20 Fan, D.P. Essential Genes in the metB-malB Region of Escherichia coli K12, 1975, J. 

Bacteriol. 126:48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the 
rapid rise of antibiotic resistant microorganisms. However, prior to this invention, the 
characterization of even a single bacterial gene was a painstaking process, requiring 

25 years of effort. Accordingly, there is an urgent need for more novel methods to identify 

and characterize bacterial genomic sequences that encode gene products required for 
proliferation and for methods to identify molecules that interact with and alter the 
functions of such genes and gene products. 

SUMMARY OF THE INVENTION 

30 

One embodiment of the present invention is a purified or isolated nucleic acid 
sequence consisting essentially of one of SEQ ID NOs: 1-81, 405-485, wherein said 
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nucleic acid inhibits microorganism proliferation. The nucleic acid sequence may be 
complementary to at least a portion of a coding sequence of a gene whose expression is 
required for microorganism proliferation. The nucleic acid sequence may comprise a 
fragment of one of SEQ ID NOs, 1-81, 405-485, said fragment selected from the group 
5 consisting of fragments comprising at least 10, at least 20, at least 25, at least 30, at 

least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. The 
nucleic acid sequence may be complementary to a coding sequence of a gene whose 
expression is required for microorganism proliferation. 

Another embodiment of the present invention is a vector comprising a promoter 

10 operably linked to a nucleic acid comprising a sequence selected from the group 

consisting of SEQ ID NOs. 1-81, 405-485. The promoter may be active in an organism 
selected from the group consisting of Escherichia coli, Slaphy/ococcus aureus, 
Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enierococcus faecalis, Streptococcus pneumoniae, Haemophilus 

15 influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 

Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi, Salmonella cholerasuis. Staphylococcus epidermidis, 
Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 
anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia 

20 trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 

the above species. 

Another embodiment of the present invention is a host cell containing the 
vectors described above. 

Another embodiment of the present invention is a purified or isolated nucleic 
25 acid consisting essentially of the coding sequence of one of SEQ ID NOs: 82-88, 90- 

242. One aspect of this embodiment is a fragment of the nucleic acid comprising at 
least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases 
of one of SEQ ID NOs: 82-88, 90-242. 

Another embodiment of the present invention isa vector comprising a promoter 
30 operably linked to the nucleic acids of the preceding embodiment. 

Another aspect of the present invention is a purified or isolated nucleic acid 
comprising a nucleic acid sequence complementary to at least a portion of an intragenic 
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sequence, intergenic sequence, sequences spanning at least a portion of two or more 
genes, 5' noncoding region, or 3' noncoding region within an operon encoding a 
polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs: 
243-357, 359-398. 

5 Another embodiment of the present invention is a purified or isolated nucleic 

acid comprising a nucleic acid having at least 70% homology to a sequence selected 
from the group consisting of SEQ ID NOs 1-81, 405-485, 82-88, 90-242 or the 
sequences complementary thereto as determined using BLASTN version 2.0 with the 
default parameters. The nucleic acid may be from an organism selected from the group 

1 0 consisting of Staphylococcus aureus, Pseudomonas aeruginosa, Enterobacter cloacae, 

Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis, Streptococcus 
pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, 
Klebsiella pneumoniae. Salmonella typhi, Salmonella paratyphi, Salmonella 

15 cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium 

leprae, Treponema pallidum, Bacillus anthracis. Yersinia pestis, Clostridium 
botulinum, Campylobacter jejuni, and Chlamydia trachomatus, Chlamydia pneumoniae 
or any species falling within the genera of any of the above species. 

Another embodiment of the present invention is a purified or isolated nucleic 

20 acid consisting essentially of a nucleic acid encoding a polypeptide having a sequence 

selected from the group consisting of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is a vector comprising a promoter 
operably linked to a nucleic acid encoding a polypeptide having a sequence selected 
from the group consisting of SEQ ID NOs.: 243-357, 359-398, 

25 Another embodiment of the present invention is a host cell containing the vector 

of the preceding embodiment. 

Another embodiment of the present invention is purified or isolated polypeptide 
comprising the sequence of one of SEQ ID NOs: 243-357, 359-398. 

Another embodiment of the present invention is purified or isolated polypeptide 

30 comprising a fragment of one of the polypeptides of SEQ ID NOs. 243-357, 359-398, 

said fragment selected from the group consisting of fragments comprising at least 5, at 
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least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the polypeptides of SEQ ID NOs.: 243-357, 359-398. 

Another embodiment of the present invention is an antibody capable of 
specifically binding the polypeptide of the preceding embodiment. 
5 Another embodiment of the present invention is method of producing a 

polypeptide, comprising introducing a vector comprising a promoter operably linked to 
a nucleic acid encoding a polypeptide having a sequence selected from the group 
consisting of SEQ ID NOs. 243-357, 359-398into a cell. The method may further 
comprise the step of isolating said protein, 
10 Another embodiment of the present invention is a method of inhibiting 

proliferation comprising inhibiting the activity or reducing the amount of a polypeptide 
having a sequence selected from the group consisting of SEQ ID NOs. 243-357, 359- 
398or inhibiting the activity or reducing the amount of a nucleic acid encoding said 
polypeptide. 

15 Another embodiment of the present invention is method for identifying 

compounds which influence the activity of a polypeptide required for proliferation 
comprising: 

contacting a polypeptide comprising a sequence selected from the group 
consisting of 243-357, 359-398with a candidate compound; and 
20 determining whether said compound influences the activity of said polypeptide. 

The activity may be an enzymatic activity. The activity may be a carbon 
compound catabolism activity. The activity may be a biosynthetic activity. The activity 
may be a transporter activity. The activity may be a transcriptional activity. The 
activity may be a DNA replication activity. The activity may be a cell division activity. 
25 Another embodiment of the present invention is a compound identified using the 

above method. 

Another embodiment of the present invention is method for assaying compounds 
for the ability to reduce the activity or level of a polypeptide required for proliferation, 
comprising: 

30 providing a target, wherein said target comprises the coding sequence of a 

sequence selected from the group consisting of SEQ ID NOs. 82-88, 90-242; 
contacting said target with a candidate compound; and 



-6- 




measuring an activity of said target. 

The target may be a messenger RNA molecule transcribed from a coding region 
of one of SEQ ID. NOs.: 82-88, 90-242 and said activity is translation of said 
messenger RNA. The target may be a coding region of one of SEQ ID. NOs. 82-88, 90- 
5 242 and said activity is transcription of said messenger RNA. 

Another embodiment of the present invention is a compound identified using the 
method above. 

Another embodiment of the present invention is a method for identifying 
compounds which reduce the activity or level of a gene product required for cell 

10 proliferation comprising the steps of: 

expressing an antisense nucleic acid against a nucleic acid encoding said gene 
product in a cell to reduce the activity or amount of said gene product in said cell, 
thereby producing a sensitized cell; 

contacting said sensitized cell with a compound; and 

15 determining whether said compound inhibits the growth of said sensitized cell to 

a greater extent than said compound inhibits the growth of a nonsensitized cell. 

The cell may be selected from the group consisting of bacterial cells, fungal 
cells, plant cells, and animal cells. The cell may be an E, coli cell. The cell may be 
from an organism selected from the group consisting of Staphylococcus aureus, 

20 Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 

gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 
influenzae, Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi. Salmonella paratyphi, Salmonella cholerasuis. Staphylococcus epidermidis, 

25 Mycobacterium tuberculosis, Mycobacteriim leprae, Treponema pallidum, Bacillus 

anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, and Chlamydia 
trachomatus. Chlamydia pneumoniae or any species falling within the genera of any of 
the above species. The antisense nucleic acid may be transcribed from an inducible 
promoter. The method may, further comprise the step of contacting said cell with a 

30 concentration of inducer which induces said antisense nucleic acid to a sublethal level. 

The sub-lethal concentration of said inducer may be such that growth inhibition is 8% 
or more. The inducer may be isopropyl-l-thio-P-D-galactoside. The growth inliibition 



-7- 




may be measured by monitoring optical density of a culture growth solution. The gene 
product may be a polypeptide. The gene product may be an RNA. The gene product 
may comprise a polypeptide having a sequence selected from the group consisting of 
SEQ ID NOs.: 243-357, 359-398. 
5 Another embodiment of the present invention is a compound identified using the 

method above. 

Another embodiment of the present invention is a method for inhibiting cellular 
proliferation comprising introducing a compound with activity against a gene 
corresponding to one of SEQ ID NOs.: 82-88, 90-242 or with activity against the 

10 product of said gene into a population of cells expressing a gene. The compound may 

be an antisense oligonucleotide comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-81, 405-485, or a proliferation-inhibiting portion thereof. 
The proliferation inhibiting portion of one of SEQ ID NOs. 1-81, 405-485 may be a 
fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more 

15 than 50 consecutive bases of one of SEQ ID NOs: 1-81, 405-485. The compound may 

be a triple helix oligonucleotide. 

Another embodiment of the present invention is a preparation comprising an 
effective concentration of an antisense oligonucleotide comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 1-81, 405-485, or a proliferation-inhibiting 

20 portion thereof in a pharmaceutically acceptable carrier. The proliferation-inhibiting 

portion of one of SEQ ID NOs. 1-81, 405-485 may comprise at least 10, at least 20, at 
least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID 
NOs: 1-81,405-485. 

Another embodiment of the present invention is a method for inhibiting the 

25 expression of a gene in an operon required for proliferation comprising contacting a cell 

in a cell population with an antisense nucleic acid, said cell expressing a gene 
corresponding to one of SEQ ID NOs.: 82-88, 90-242, w^herein said antisense nucleic 
acid comprises at least a proliferation-inhibiting portion of said operon in an antisense 
orientation that is effective in inhibiting expression of said gene. The antisense nucleic 

30 acid may be complementary to a sequence of a gene comprising one or more of SEQ ID 

NOs.: 82-88, 90-242. The antisense nucleic acid may be a sequence of one of SEQ ID 
NOs.: 1-81, 405-485, or a portion thereof The cell may be contacted with said 
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antisense nucleic acid by introducing a plasmid which expresses said antisense nucleic 
acid into said cell population. The cell may be contacted with said antisense nucleic 
acid by introducing a phage wliich expresses said antisense nucleic acid into said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a 
5 sequence encoding said antisense nucleic acid into the chromosome of said cell into 

said ceil population. The cell may be contacted with said antisense nucleic acid by 
introducing a retron which expresses said antisense nucleic acid into said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a 
ribozyme into said cell-population, w^herein a binding portion of said ribozyme is 

10 complementary to said antisense oligonucleotide. The cell may be contacted with said 

antisense nucleic acid by introducing a liposome comprising said antisense 
oligonucleotide into said cell. The cell may be contacted with said antisense nucleic 
acid by electroporation. The antisense nucleic acid may be a fragment comprising at 
least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases 

15 of one of SEQ ID NOs: 82-88, 90-242. The antisense nucleic acid may be an 

oligonucleotide. 

Another embodiment of the present invention is a method for identifying 
bacterial strains comprising the steps of: 

providing a sample containing a bacterial species; and 
20 identifying a bacterial species using a species specific probe having a sequence 

selected from the group consisting of SEQ ID NOs. 1-81, 405-485. 82-88, 90-242. 

Another embodiment of the present invention is a method for identifying a gene 
in a microorganism required for proliferation comprising: 

(a) identifying an inhibitory nucleic acid which inhibits the activity of a gene or 
25 gene product required for proliferation in a first microorganism; 

(b) contacting a second microorganism with said inhibitory nucleic acid; 

(c) determining whether said inhibitory nucleic acid from said first microorganism 
inhibits proliferation of said second microorganism; and 

(d) identifying the gene in said second microorganism which is inhibited by said 
30 inhibitory nucleic acid. 

Another embodiment of the present invention is a method for assaying a 
compound for the ability to inhibit proliferation of a microorganism comprising: 
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(a) identifying a gene or gene product required for proliferation in a first 
microorganism; 

(b) identifying a homolog of said gene or gene product in a second microorganism; 

(c) identifying an inhibitory nucleic acid sequence which inhibits the activity of said 
5 homolog in said second microorgansim; 

(d) contacting said second microorganism with a proliferation-inhibiting amount of 
said inhibitory nucleic acid, thus sensitizing said second microorganism; 

(e) contacting the sensitized microorganism of step (d) with a compound; and 

(f) determining whether said compound inhibits proliferation of said sensitized 
10 microorganism to a greater extent than said compound inhibits proliferation of a 

nonsensitized microorganism. 

The step of identifying a gene involved in proliferation in a first microorganism may 
comprise: 

introducing a nucleic acid comprising a random genomic fragment from said 
15 first microorganism operably linked to a promoter wherein said random genomic 

fragment is in the antisense orientation; and 

comparing the proliferation of said first microorganism transcribing a first level 
of said random genomic fragment to the proliferation of said first microorganism 
transcribing a lower level of said random genomic fragment, wherein a difference in 
20 proliferation indicates that said random genomic fragment comprises a gene involved in 

proliferation. 

The step of identifying a homolog of said gene in a second microorganism may 
comprise identifying a homologous nucleic acid or a nucleic acid encoding a 
homologous polypeptide in a database using an algorithm selected from the group 

25 consisting of BLASTN version 2.0 with the default parameters and FASTA version 

3.0t78 algorithm with the default parameters. The step of identifying a homolog of said 
gene in a second microorganism may comprise identifying a homologous nucleic acid 
or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids which 
hybridize to said first gene. The step of identifying a homolog of said gene in a second 

30 microorganism may comprise expressing a nucleic acid which inhibits the proliferation 

of said first microorganism in said second microorganism. The inhibitory nucleic acid 
may be an antisense nucleic acid. The inhibitory nucleic acid may comprise an 
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antisense nucleic acid to a portion of said homolog. The inhibitory nucleic acid may 
comprise an antisense nucleic acid to a portion of the operon encoding said homolog. 
The step of contacting the second microorganism with a proliferation-inhibiting amount 
of said nucleic acid sequence may comprise directly contacting said second 
5 microorganism with said nucleic acid. The step of contacting the second 

microorganism with a proliferation-inhibiting amount of said nucleic acid sequence may 
comprise expressing an antisense nucleic acid to said homolog in said second 
microorganism. 

Another embodiment of the present invention is a compound identified using the 
1 0 method above. 

Another embodiment of the present invention is a method of assaying a 
compound for the ability to inhibit proliferation comprising: 

(a) identifying an inhibitory nucleic acid sequence which inliibits the activity of a 
gene or gene product required for proliferation in a first microorgansim; 
15 (b) contacting a second microorganism with a proliferation-inliibiting amount of 

said inhibitory nucleic acid, thus sensitizing said second microorganism; 

(c) contacting the proliferation-inhibited microorganism of step (b) with a 
compound; and 

(d) determining w^hether said compound inhibits proliferation of said sensitized 
20 second microorganism to a greater extent than said compound inliibits proliferation of a 

nonsensitized second microorganism. 

The inhibitory nucleic acid may be an antisense nucleic acid which inhibits the 
proliferation of said first microorganism. The inhibitory nucleic acid may comprise a 
portion of an antisense nucleic acid which inliibits the proliferation of said first 
25 microorganism. The inhibitory nucleic acid may comprise an antisense molecule 

against the entire coding region of the gene involved in proliferation of the first 
microorganism. The inhibitory nucleic acid may comprise an antisense nucleic acid to a 
portion of the operon encoding the gene involved in proliferaUon of the first 
microorganism. 

30 Another embodiment of the present invention is a compound identified using the 

method above. 
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Another embodiment of the present invention is a method for assaying 
compounds for activity against a biological pathway required for proliferation 
comprising: 

sensitizing a cell by expressing an antisense nucleic acid against a nucleic acid 
5 encoding a gene product required for proliferation in a cell to reduce the activity or 

amount of said gene product; 

contacting the sensitized cell with a compound; and 

determining whether said compound inhibits the growth of said sensitized cell to 
a greater extent than said compound inhibits the growth of an nonsensitized cell. 

10 The cell may be selected from the group consisting of bacterial cells, fungal 

cells, plant cells, and animal cells. The cell may be an E, coli cell. The cell may be an 
organism selected from the group consisting of Staphylococcus aureus, Pseudomonas 
aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 

15 typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 

Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi, 
Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 
Mycobacterium leprae, Treponema pallidum. Bacillus anthracis. Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni, and Chlamydia trachomatus, Chlamydia 

20 pneumoniae or any species falling within the genera of any of the above species. The 

antisense nucleic acid may be transcribed from an inducible promoter. The method may 
further comprise contacting the cell with an agent wliich induces expression of said 
antisense nucleic acid from said inducible promoter, wherein said antisense nucleic acid 
is expressed at a sublethal level. The sublethal level of said antisense nucleic acid may 

25 inhibit proliferation by 8% or more. The agent may be isopropyl-l-thio-p-D- 

galactoside (IPTG ). The inhibition of proliferation may be measured by monitoring the 
optical density of a liquid culture. The gene product may comprise a polypeptide 
having a sequence selected from the group consisting of SEQ ID NOs: 243-357, 359- 
398. 

30 Another embodiment of the present invention is a compound identified using the 

method above. 
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Another embodiment of the present invention is a method for assaying a 
compound for the ability to inhibit cellular proliferation comprising: 

contacting a cell with an agent which reduces the activity or level of a gene 
product required for proliferation of said cell; 
5 contacting said cell with said compound; and 

determining whether said compound reduces proliferation to a greater extent 
than said compound reduces proliferation of cells which have not been contacted with 
said agent. 

The agent which reduces the activity or level of a gene product required for 
10 proliferation of said cell may comprise an antisense nucleic acid to a gene or operon 

required for proliferation. The agent which reduces the activity or level of a gene 
product required for proliferation of said cell may comprise an antibiotic. The cell may 
contain a temperature sensitive mutation which reduces the activity or level of said gene 
product required for proliferation of said cell. The antisense nucleic acid may be 
15 directed against the same functional domain of said gene product required for 

proliferation of said cell to wliich said antisense nucleic acid is directed. The antisense 
nucleic acid may be directed against a different functional domain of said gene product 
required for proliferation of said cell than the fucntional domain to which said antisense 
nucleic acid is directed. 

20 Another embodiment of the present invention is a compound identified using the 

method above. 

Another embodiment of the present invention is a method for identifying the 
pathway in which a proliferation-required nucleic acid or its gene product lies 
comprising: 

25 expressing a sublethal level of an antisense nucleic acid directed against said 

proliferation-required nucleic acid in a cell; 

contacting said cell with an antibiotic, wlierein the a biological pathway on 
which said antibiotic acts is known; and 

determining whether said cell has a substantially greater sensitivity to said 
30 antibiotic than a cell which does not express said sublethal level of said antisense 

nucleic acid. 
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Another embodiment of the present invention is a method for determining the 
pathway on which a test compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid directed against a 
proliferation-required nucleic acid in a cell, wherein the biological pathway in which 

5 said proliferation-required nucleic acid lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell has a substantially greater sensitivity to said 
test compound than a cell which does not express said sublethal level of said antisense 
nucleic acid. 

10 The method may further comprise: 

(d) expressing a sublethal level of a second antisense nucleic acid directed against a 

second proliferation-required nucleic acid in said cell wherein said second 

proliferation-required nucleic acid is in a different biological pathway than said 

proliferation-required nucleic acid in step (a); and 
15 (e) determining whether said cell has a substantially greater sensitivity to said test 

compound than a cell which does not express said sublethal level of said second 

antisense nucleic acid. 

Another embodiment of the present invention is a purified or isolated nucleic 

acid consisting essentially of one of SEQ ID NOs: 358, 399-402. 
20 Another embodiment of the present invention is a purified or isolated nucleic 

acid comprising a sequence selected from the group consisting of 1-81, 405-485, 82-88, 

90-242, 358, 399-402. 

Another embodiment of the present invention is a compound which interacts 

with the gene or gene product of a nucleic acid comprising a sequence of one of SEQ 
25 ID NOs: 82-88, 90-242 to inhibit proliferation. 

Another embodiment of the present invention compound which interacts with a 

polypeptide comprising one of SEQ ID NOs. 243-357, 359-398 to inhibit prohferation. 
Another embodiment of the present invention is a compound which interacts 

with a nucleic acid comprising one of SEQ ID NOs: 358, 399-402 to inhibit 
30 proliferation. 

BRIEF DESCRIPTION OF THE DRAWINGS 



-14- 




Figure 1 is an IPTG dose response curve in £. coli transformed with an 
IPTG-inducible plasmid containing either an antisense clone to the E. coli ribosomal 
protein rplW (AS-rplW) which is required for protein synthesis and essential cell 
proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be 
5 involved in protein synthesis and which is also essential for proliferation. 

Figure 2A is a tetracycline dose response curve in E. coli transformed 
with an IPTG-inducible plasmid containing antisense to rplW(AS-rplW) in the 
presence of 0, 20 or 50 \xU IPTG. 

Figure 2B is a tetracycline dose response curve in E. coli transformed 
10 with an IPTG-inducible plasmid containing antisense to elaD (AS-elaD) in the presence 

ofO, 20 orSO^iM IPTG. 

Figure 3 is a graph showing the fold increase in tetracycline sensitivity of 
E. coli transfected with antisense clones to essential ribosomal proteins L23 { AS-rplW) 
and L7/L12 and LIO (AS-rplLrplJ). Antisense clones to genes known not to be 
15 involved in protein synthesis (atpB/E(AS-atpB/E ), visC (AS-visC, elaD (AS-elaD), 

yohH (AS-yohH) are much less sensitive to tetracycline. 

Definitions 

By "biological pathw^ay" is meant any discrete cell function or process that is 
carried out by a gene product or a subset of gene products. Biological pathways include 

20 enzymatic, biochemical and metabolic pathways as well as pathways involved in the 

production of cellular structures such cell walls. Biological pathway s that are usually 
required for proliferation of microorganisms include, but are not limited to, cell 
division, DNA synthesis & replication, RNA synthesis (transcription), protein synthesis 
(translation), protein processing, protein transport, fatty acid biosynthesis, cell wall 

25 synthesis, cell membrane synthesis & maintenance, etc. 

By ''inhibit activity against a gene or gene product" is meant having the ability 
to interfere with the function of a gene or gene product in such a way as to decrease 
expression of the gene or to reduce the level or activity of a product of the gene. Agents 
which have activity against a gene include agents that inhibit transcription of the gene 

30 and agents that inhibit translation of the mRNA transcribed from the gene. In 

microorganisms, agents which have activity against a gene can act to decrease 
expression of the operon in which the gene resides or alter the processing of operon 
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RNA such as to reduce the level or activity of the gene product. The gene product can 
be a non-translated RNA such as ribosomal RNA, a translated RNA (niRNA) or the 
protein product resulting from translation of the gene mRNA. Of particular utility to 
the present invention are anti-sense RNAs that have activities against the operons or 
5 genes to which they specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the 
function or to reduce the level or activity of the gene product in a cell. 

By ''activity against a protein" is meant having the ability to inhibit the function 
or to reduce the level or activity of the protein in a cell. 

10 By "activity against nucleic acid" is meant having the ability to inhibit the 

function or to reduce the level or activity of the nucleic acid in a cell. 

As used herein, "sublethal" means a concentration of an agent below the 
concentration required to inhibit all cell growth. 

DETAILED DESCRIPTION OF THE INVENTION 

15 The present invention describes a group of E. coli genes and gene families required 

for growth and/or proliferation. A proliferation-required gene or gene family is one where, 
in the absence of a gene transcript and/or gene product, growth or viability of the 
microorganism is reduced or eliminated, Tlius. as used herein the terminology 
"proliferation-required" or "required for proliferation" encompasses sequences where the 

20 absence of a gene transcript and/or gene product completely eliminates cell growth as well 

as sequences where the absence of a gene transcript and/or gene product merely reduces 
cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also 
encompasses novel assays for analyzing proliferation-required genes and for identifying 

25 compounds which interact with the gene products of the proliferation-required genes. In 

addition, the present invention contemplates the expression of genes and the purification of 
the proteins encoded by the nucleic acid sequences identified as required proliferation 
genes and reported herein. The purified proteins can be used to generate reagents and 
screen small molecule libraries or other candidate compound libraries for compounds that 

30 can be further developed to yield novel antimicrobial compounds. The present invention 

also describes methods for identification of homologous genes in organisms other than E. 
coli. 
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The present invention utilizes a novel method to identify proliferation-required E. 
coli sequences. Generally, a library of nucleic acid sequences from a given source are 
subcloned or otherwise inserted into an inducible expression vector, thus fonning an 
expression library. Although the insert nucleic acids may be derived from the 
5 chromosome of the organism into wliich the expression vector is to be introduced, because 

the insert is not in its natural chromosomal location, the insert nucleic acid is an exogenous 
nucleic acid for the purposes of the discussion herein. The term expression is defined as 
the production of an RNA molecule from a gene, gene fragment, genomic fragment, or 
operon. Expression can also be used to refer to the process of peptide or polypeptide 

10 synthesis. An expression vector is defined as a vehicle by which a ribonucleic acid (RNA) 

sequence is transcribed from a nucleic acid sequence carried within the expression vehicle. 
The expression vector can also contain features that permit translation of a protein product 
from the transcribed RNA message expressed from the exogenous nucleic acid sequence 
carried by the expression vector. Accordingly, an expression vector can produce an RNA 

15 molecule as its sole product or the expression vector can produce a RNA molecule that is 

ultimately translated into a protein product. 

Once generated, the expression librarj' containing the exogenous nucleic acid 
sequences is introduced into an E. coli population to search for genes that are required for 
bacterial proliferation. Because the library molecules are foreign to the population of E. 

20 coli, the expression vectors and the nucleic acid segments contained therein are considered 

exogenous nucleic acid. 

Expression of the exogenous nucleic acid fragments in the test population of E. coli 
containing the expression vector library is then activated. Activation of the expression 
vectors consists of subjecting the cells containing the vectors to conditions that result in the 

25 expression of the exogenous nucleic acid sequences carried by the expression vector 

library. The test population of E. coli cells is then assayed to determine the effect of 
expressing the exogenous nucleic acid fragments on the test population of cells. Those 
expression vectors that, upon activation and expression, negatively impact the growth of 
the £. coli screen population were identified, isolated, and purified for further study. 

30 A variety of assays are contemplated to identify nucleic acid sequences that 

negatively impact growth upon expression. In one embodiment, growth in E. coli cultures 
expressing exogenous nucleic acid sequences and growth in cultures not expressing these 
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sequences is compared. Growth measurements are assayed by examining the extent of 
growth by measuring optical densities. Alternatively, enzymatic assays can be used to 
measure bacterial growth rates to identify exogenous nucleic acid sequences of interest. 
Colony size, colony morphology, and cell morphology are additional factors used to 
5 evaluate growth of the host cells. Those cultures that failed to grow or grow with reduced 

efficiency under expression conditions are identified as containing an expression vector 
encoding a nucleic acid fragment tliat negatively affects a proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are 
analyzed. The first step of the analysis is to acquire the nucleic acid sequence of the 

10 nucleic acid fragment of interest. To achieve this end, the insert in those expression 

vectors identified as containing a sequence of interest is sequenced, using standard 
techniques well known in the art. The next step of the process is to deteniiine the source 
of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence 

15 data with known sequences in various genetic databases. The sequences identified are 

used to probe these gene databases. The result of this procedure is a list of exogenous 
nucleic acid sequences corresponding to a list that includeds novel bacterial genes required 
for proliferation as well as genes previously identified as required for proliferation. 

The number of DNA and protein sequences available in database systems has been 

20 growing exponentially for years. For example, at the end of 1998, the complete sequences 

of Caenorhabditis elegans, Saccharomyces cerevisiae and nineteen bacterial genomes, 
including E. coli were available. This sequence information is stored in a number of 
databanks, such as GenBank (the National Center for Biotechnology Information (NCBI), 
and is publicly available for searching. 

25 A variety of computer programs are available to assist in the analysis of the 

sequences stored within these databases. FastA, (W. R. Pearson (1990) "Rapid and 
Sensitive Sequence Comparison with FASTP and FASTA" Methods in Enzymology 
183:63- 98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing and 
retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9:49-57, 1993) are two 

30 examples of computer programs that can be used to analyze sequences of interest. In 

one embodiment of the present invention, the BLAST family of computer programs, 
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which includes BLASTN version 2.0 with the default parameters, or BLASTX \ ersion 
2.0 with the default parameters, is used to analyze nucleic acid sequences. 

BLAST, aH acronym for "Basic Local Alignment Search Tool," is a family of 
^ programs for database similarity searching. The BLAST family of programs includes: 
5 J' BLASTN, a nucleotide sequence database searching program, BLASTX, a protein 
database searching program where the input is a nucleic acid sequence; and BLASTP, a 
protein database searching program. BLAST programs embody a fast algorithm for 
sequence matching, rigorous statistical methods for judging the significance of matches, 
and various options for tailoring the program for special situations. Assistance in using the 
1 0 program can be obtained by e-mail at blast@ncbi.nlm.nih.gov. 

Bacterial genes are often transcribed in polycistronic groups. These groups 
comprise operons, which are a collection of genes and intergenic sequences. The genes of 
an operon are co-transcribed and are often related functionally. Given the nature of the 
screening protocol, it is possible that the identified exogenous nucleic acid sequence 
15 corresponds to a gene or portion thereof with or without adjacent noncoding sequences, an 

intragenic sequence (i.e. a sequence within a gene), an intergenic sequence (i.e. a sequence 
between genes), a sequence spanning at least a portion of tw^o or more genes, a 5' 
noncoding region or a 3' noncoding region located upstream or dowTistream from the 
actual sequence that is required for bacterial proliferation. Accordingly, detemiining 
20 which of the genes that are encoded within the operons are individually required for 

proliferation is often desirable. 

In one embodiment of the present invention, an operon is dissected to determine 
which gene or genes are required for proliferation. For example, the RegulonDB 
' ' DataBase described by Huerta et al. (NucL Acids Res. 26:55-59, 1998), which may also be 
25 found on the website http://www.cifn.unam.mx/Computational_Biology/regulondb/, may 

be used, to identify the boundaries of operons encoded within microbial genomes. A 
number of techniques that are well known in the art can be used to dissect the operon. In 
one aspect of this embodiment, gene disruption by homologous recombination is used to 
individually inactivate the genes of an operon that is thought to contain a gene required for 
30 proliferation. 

Several gene disruption techniques have been described for the replacement of a 
functional gene with a mutated, non-functional (null) allele. These techniques generally 
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involve the use of homologous recombination. The method described by Link et al. (J. 
Bacteriol 1997 179:6228; incorporated herein by reference in it's entirety) serves as an 
excellent example of these methods as applicable to disruption of genes in E. coli. This 
technique uses crossover PCR to create a null allele with an in-frame deletion of the 
5 coding region of a target gene. The null allele is constructed in such a way that 

sequences adjacent to the wild type gene (ca. 500 bp) are retained. These homologous 
sequences surrounding the deletion null allele provide targets for homologous 
recombination so that the wild type gene on the E. coli chromosome can be replaced by 
the constructed null allele. 

10 The crossover PCR amplification product is subcloned into the vector pK03, the 

features of which include a chloramphenicol resistance gene, the counter-selectable 
marker sacB, and a temperature sensitive autonomous replication function. Following 
transformation of an E. coli cell population with such a vector, selection for cells that 
have undergone homologous recombination of the vector into the chromosome is 

15 achieved by growth on chloramphenicol at the non-permissive temperature of 43T. 

Under these conditions, autonomous replication of the plasmid cannot occur and cell are 
resistant to chloramphinicol only if the chloramphenicol resistance gene has been 
integrated into the chromosome. Usually a single crossover event is responsible for 
this integration event such that the E. coli chromosome now contains a tandem 

20 duplication of the target gene consisting of one wild type allele and one deletion null 

allele separated by vector sequence. 

This new E. coli strain containing the tandem duplication can be maintained at 
permissive temperatures in the presence of drug selection (chloramphenicol). 
Subsequently, cells of this new strain are cultured at the permissive temperature 30°C 

25 without drug selection. Under these conditions, the chromosome of some of the cells 

within the population will have undergone an internal homologous recombination event 
resulting in removal of the plasmid sequences. Subsequent culturing of the strain in 
growth medium lacking chloramphenicol but containing sucrose is used to select for 
such recombinative resolutions. In the presence of the counter-selectable marker sacB, 

30 sucrose is rendered into a toxic metabolite. Thus, cells that survive this counter- 

selection have lost both the plasmid sequences from the chromosome and the 
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autonomously replicating plasmid that results as a byproduct of recombinative 
resolution. 

There are two possible outcomes of the above recombinative resolution via 
homologous recombination. Either the wild type copy of the targeted gene is retained 
5 on the chromosome or the mutated null allele is retained on the chromosome. In the 

case of an essential gene, a single copy of the null allele would be lethal and such cells 
should not be obtained by the above procedure when applied to essential genes. In the 
case of a non-essential gene, roughly equal numbers of cells containing null alleles and 
cells containing wild type alleles should be obtained. Thus, the method serves as a test 

10 for essentiality of the targeted gene: when applied to essential genes, only cells with a 

wild type allele on the chromosome will be obtained. 

Other techniques have also been described for the creation of disruption 
mutations in E, coli. For example, Link et al. also describe inserting an in-frame 
sequence tag concommitantly with an in-frame deletion in order to simplify analysis of 

15 recombinants obtained. Further, Link et al. describe disruption of genes with a drug 

resistance marker such as a kanamycin resistance gene. Arigoni et al., (Arigoni, F. et al. 
A Genome-based Approach for the Identification of Essential Bacterial Genes, Nature 
Biotechnology 16: 851-856, the disclosure of which is incorporated herein by reference 
in its entirety) describe the use of gene disruption combined with engineering a second 

20 copy of a test gene such that the expression of the gene is regulated by and inducible 

promoter such as the arabinose promoter to test the essentiality of the gene. Many of 
these techniques result in the insertion of large fragments of DNA into the gene of 
interest, such as a drug selection marker. An advantage of the teclinique described by 
Link et al. is that it does not rely on an insertion into the gene to cause a functional 

25 defect, but rather results in the precise removal of the coding region. This insures the 

lack of polar effects on the expression of genes downstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences 
of the gene identified as required for proliferation, or portions thereof. The over-expressed 
proteins can be used as reagents for further study. The identified exogenous sequences are 

30 isolated, purified, and cloned into a suitable expression vector using methods well known 

in tlie art. If desired, the nucleic acids can contain the sequences encoding a signal peptide 
to facilitate secretion of the expressed protein. 
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Expression of fragments of the bacterial genes identified as required for 
proliferation is also contemplated by the present invention. The fragments of the identified 
genes can encode a polypeptide comprising at least 5, at least 10, at least 15, at least 20, at 
least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at 
5 least 65, at least 75, or more than 75 consecutive amino acids of a gene complementary to 

one of the identified sequences of the present invention. The nucleic acids inserted into the 
expression vectors can also contain sequences upstream and downstream of the coding 
sequence. 

When expressing the coding sequence of an entire gene identified as required for 

10 bacterial proliferation or a fragment thereof, the nucleic acid sequence to be expressed is 

operably linked to a promoter in an expression vector using conventional cloning 
technology. The expression vector can be any of the bacterial, insect, yeast, or mammalian 
expression systems known in the art. Commercially available vectors and expression 
systems are available from a variety of suppliers including Genetics Institute (Cambridge, 

15 MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and Invitrogen 

(San Diego, California). If desired, to enhance expression and facilitate proper protein 
folding, the codon usage and codon bias of the sequence can be optimized for the 
particular expression organism in which the expression vector is introduced, as explained 
by Hatfield, et al., U.S. Patent No. 5,082,767, incorporated herein by tliis reference. 

20 Fusion protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic 
acid sequence, the protein is purified. Protein purification techniques are well known in 
the art. Proteins encoded and expressed from identified exogenous nucleic acid sequences 
can be partially purified using precipitation tecliniques, such as precipitation with 

25 polyethylene glycol. Chromatographic methods usable with the present invention can 

include ion-exchange chromatography, gel filtration, use of hydroxyapaptite columns, 
immobilized reactive dyes, chromatofocusing, and use of high-performance liquid 
chromatography. Electrophoretic methods such one-dimensional gel electrophoresis, high- 
resolution two-dimensional polyacrylamide electrophoresis, isoelectric focusing, and 

30 others are contemplated as purification methods. Also, affinity cliromatographic methods, 

comprising antibody columns, ligand presenting columns and other affinity 
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chromatographic matrices are contemplated as purification methods in the present 
invention. 

The purified proteins produced from the gene coding sequences identified as 
required for prohferation can be used in a variety of protocols to generate useful 
5 antimicrobial reagents. In one embodiment of the present invention, antibodies are 

generated against the proteins expressed from the identified exogenous nucleic acid 
sequences. Both monoclonal and polyclonal antibodies can be generated against the 
expressed proteins. Methods for generating monoclonal and polyclonal antibodies are well 
known in the art. Also, antibody fragment preparations prepared from the produced 

1 0 antibodies discussed above are contemplated. 

Another application for the purified proteins of the present invention is to screen 
small molecule libraries for candidate compounds active against the various target proteins 
of the present invention. Advances in the field of combinatorial chemistry provide 
methods, well known in the art, to produce large numbers of candidate compounds that can 

15 have a binding, or otherwise inhibitor}^ effect on a target protein. Accordingly, the 

screening of small molecule libraries for compounds with binding affinity or inhibitory 
activity for a target protein produced from an identified gene sequence is contemplated by 
the present invention. 

The present invention further contemplates utility against a xariety of other 

20 pathogenic organisms in addition to E. coli. For example, the invention has utility in 

identifying genes required for proliferation in prokar}^otes and eukar\otes. For example, 
the invention has utility with protists, such as Plasmodium spp.; plants; animals, such as 
Entamoeba spp. and Contracaeciim spp; and fungi including Candida spp., (e.g., 
Candida albicans), Saccharomyces cerevisiae, Cryptococcus neoformans, and Aspergillus 

25 fumigatus. In one embodiment of the present invention, monera, specifically bacteria are 
probed in search of novel gene sequences required for proliferation. This embodiment is 
particularly important given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics 
are growing. A partial list of these organisms includes: Staphylococcus spp., such as S. 

30 aureus', Enterococcus spp., such as E. faecalis; Pseudomonas spp.. such as P. 

aeruginosa, Clostridium spp., such as C botulinum, Haemophilus spp., such as H. 
influenzae, Enterobacter spp., such as E. cloacae, Vibrio spp., such as V. cholera: 
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Moraxala spp,, such as M catarrhalis\ Streptococcus spp., such as S. pneumoniae, 
Neisseria spp., such as A^. gonorrhoeae; Mycoplasma spp., such as Mycoplasma 
pneumoniae; Salmonella typhimuriim; Helicobacter pylori: Escherichia coli; and 
Mycobacterium tuberculosis. The sequences identified as required for proHferation in 
5 the present invention can be used to probe these and other organisms to identify 

homologous required proHferation genes contained therein. 

In one embodiment of the present invention, the nucleic acid sequences disclosed 
herein are used to screen genomic libraries generated from bacterial species of interest 
other than E. coli. For example, the genomic library may be from Staphylococcus aureus, 

10 Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 

gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus 
influenzae. Salmonella typhimurium, Saccharomyces cerevisiae, Ccmdida albicans, 
Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella pneumoniae, Salmonella 
typhi, Salmonella paratyphi. Salmonella cholerasuis, Staphylococcus epidermidis, 

15 Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 

anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia 
trachomatus, Chlamydia pneumoniae or any species falling within the genera of any of 
the above species. Standard molecular biology techniques are used to generate genomic 
libraries from various microorganisms, hi one aspect, the libraries are generated and 

20 bound to nitrocellulose paper. The identified exogenous nucleic acid sequences of the 

present invention can then be used as probes to screen the libraries for homologous 
sequences. The homologous sequences identified can then be used as targets for the 
identification of new, antimicrobial compounds with activity against more than one 
organism. 

25 For example, the preceding methods may be used to isolate nucleic acids having 

a sequence with at least 97%, at least 95%, at least 90%, at least 85%), at least 80°/o, or at 
least 70% identity to a nucleic acid sequence selected from the group consisting of one 
of the sequences of SEQ ID NOS. 1-81, 405-485, 82-88, 90-242, fragments comprising 
at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 

30 bases thereof, and the sequences complementary thereto. Identity may be measured 

using BLASTN version 2.0 with the default parameters. (Altschul, S.F. et al. Gapped 
BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs, 



-24- 




Nucleic Acid Res. 25: 3389-3402 (1997). the disclosure of which is incorporated herein 
by reference in its entirety). For example, the homologous polynucleotides may have a 
coding sequence which is a naturally occurring allelic variant of one of the coding 
sequences described herein. Such allelic variants may have a substitution, deletion or 
5 addition of one or more nucleotides when compared to the nucleic acids of SEQ ID 

NOs: 1-81, 405-485, 82-88, 90-242 or the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which 
encode polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80*^/0, 
at least 70%), at least 60%, at least 50%, or at least 40% identity or similarity to a 

10 polypeptide having the sequence of one of SEQ ID NOs: 243-357, 359-398or fragments 

comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino 
acids thereof as determined using the FASTA version 3.0t78 algorithm with the default 
parameters. Alternatively, protein identity or similarity may be identified using 
BLASTP with the default parameters, BLASTX with the default parameters, or 

15 TBLASTN with the default parameters. (Alschul, S.F. et al. Gapped BLAST and PSI- 

BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 
25: 3389-3402 (1997), the disclosure of which is incorporated herein by reference in its 
entirety). 

Alternatively, homologous nucleic acids or polypeptides may be identified by 
20 searching a database to identify sequences having a desired level of homology to a 

nucleic acid or polypeptide involved in proliferation or an antisense nucleic acid to a 
nucleic acid involved in microbial proliferation. A variety of such databases are 
available to those skilled in the art, including GenBank and GenSeq. In some 
embodiments, the databases are screened to identify nucleic acids or polypeptides 
25 having at least 97%o, at least 95%, at least 90%, at least 85%o, at least 80%, at least 70%o, 

at least 60%, or at least 50%o, at least 40% identity or similarity to a nucleic acid or 
polypeptide involved in proliferation or an antisense nucleic acid involved in 
proliferation. For example, the database may be screened to identify nucleic acids 
homologous to one of SEQ ID Nos. 1-81, 405-485, 82-88, 90-242 or polypeptides 
30 homologous to SEQ ID NOs. 243-357, 359-398. In some embodiments, the database 

may be screened to identify homologous nucleic acids or polypeptides from organisms 
other than E. coli, including organisms such as Staphylococcus aureus, Pseudomonas 
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aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albiccms, Cryptococcus neoformans, 
Aspergillus fiimigatus, Klebsiella pneumoniae, Salmonella typhi, Salmonella paratyphi, 
5 Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 

Mycobacterium leprae, Treponema pallidum, Bacillus anthracis, Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni, Chlamydia trachomatus, Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. 

In another embodiment, gene expression arrays and microarrays can be 

10 employed. Gene expression arrays are high density arrays of DNA samples deposited at 

specific locations on a glass chip, nylon membrane, or the like. Such arrays can be used 
by researchers to quantify relative gene expression under different conditions. Gene 
expression arrays are used by researchers to help identify optimal drug targets, profile 
new compounds, and determine disease pathways. An example of this technology is 

15 found in U.S. Patent No. 5807522, which is hereby incorporated by reference. 

It is possible to study the expression of all genes in the genome of a particular 
microbial organism using a single array. For example, the arrays from Genosys consist 
of 12 x 24 cm nylon filters containing PGR products corresponding to 4290 ORFs from 
E coli. 10 ngs of each are spotted every 1.5 mm on the filter. Single stranded labeled 

20 cDNAs are prepared for hybridization to the array (no second strand synthesis or 

amplification step is done) and placed in contact with the filter. Thus the labeled 
cDNAs are of "antisense" orientation. Quantitative analysis is done by 
phosphorimager. 

Hybridization of cDNA made from a sample of total cell mRNA to such an 
25 array followed by detection of binding by one or more of various techniques know^n to 

those in the art results in a signal at each location on the array to which cDNA 
hybridized. The intensity of the hybridization signal obtained at each location in the 
array thus reflects the amount of mRNA for that specific gene that was present in the 
sample. Comparing the results obtained for mRNA isolated from cells grown under 
30 different conditions thus allows for a comparison of the relative amount of expression 

of each individual gene during growth under the different conditions. 
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Gene expression arrays may be used to analyze the total mRNA expression 
pattern at various time points after induction of an antisense nucleic acid against a 
proliferation-required gene. Analysis of the expression pattern indicated by 
hybridization to the array provides information on whether or not the target gene of the 
5 antisense nucleic acid is being affected by antisense induction, how quickly the 

antisense is affecting the target gene, and for later timepoints, what other genes are 
affected by antisense expression. For example, if the antisense is directed against a 
gene for ribosomal protein L7/L12 in the 50S subunit, its targeted mRNA may 
disappear first and then other mRNAs may be observed to increase, decrease or stay the 

10 same. Similarly, if the antisense is directed against a different SOS subunit ribosomal 

protein mRNA (e.g. L25), that mRNA may disappear first followed by changes in 
mRNA expression that are similar to those seen wdth the L7/L12 antisense expression. 
Thus, the mRNA expression pattern observed with an antinsense nucleic acid against a 
proliferation required gene may identify other proliferation-required nucleic acids in the 

15 same pathway as the target of the antisense nucleic acid. In addition, the mRNA 

expression patterns observed with candidate drug compounds may be compared to those 
observed with antisense nucleic acids against a proliferation-required nucleic acid. If 
the mRNA expression pattern observed with the candidate drug compound is similar to 
that observed with the antisense nucleic acid, the drug compound may be a promising 

20 therapeutic candidate. Thus, the assay w^ould be useful in assisting in the selection of 

candidate drug compounds for use in screening methods such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source 
of the nucleic acid being hybridized to the array are from two different organisms, gene 
expression arrays can identify homologous genes in the two organisms. 

25 The present invention also contemplates additional methods for screening other 

microorganisms for proliferation-required genes. In this embodiment, the conserved 
portions of sequences identified as proliferation-required can be used to generate 
degenerate primers for use in the polymerase chain reaction (PGR). The PGR teclinique is 
well known in the art. The successful production of a PGR product using degenerate 

30 probes generated from the sequences identified herein would indicate the presence of a 

homologous gene sequence in the species being screened. This homologous gene is then 
isolated, expressed, and used as a target for candidate antibiotic compounds. In another 
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aspect of this embodiment, the homologous gene is expressed in an autologous organism 
or in a heterologous organism in such a way as to alter the level or activity of a 
homologous gene required for proliferation in the autologous or heterologus organism. In 
still another aspect of this embodiment, the homologous gene or portion is expressed in an 
5 antisense orientation in such a way as to alter the level or activit>^ of a nucleic acid required 

for proliferation of an autologous or heterologous organism. 

The homologous sequences to proliferation-required genes identified using the 
techniques described herein may be used to identify proliferation-required genes of 
organisms other than £ coli, to inhibit the proliferation of organisms other than E. coli 

10 by inhibiting the activity or reducing the amount of the identified homologous nucleic 

acid or polypeptide in the organism other than E. coli, or to identify compounds which 
inhibit the growth of organisms other than E. coli as described below. 

In another embodiment of the present invention, E, coli sequences identified as 
required for proliferation are transferred to expression vectors capable of function within 

15 non-^" coli species. As would be appreciated by one of ordinary skill in the art, 

expression vectors must contain certain elements that are species specific. These 
elements can include promoter sequences, operator sequences, repressor genes, origins 
of replication, ribosomal binding sequences, termination sequences, and others. To use 
the identified exogenous sequences of the present invention, one of ordinary skill in the 

20 art would know to use standard molecular biology techniques to isolate vectors 

containing the sequences of interest from cultured bacterial cells, isolate and purify 
those sequences, and subclone those sequences into an expression vector adapted for 
use in the species of bacteria to be screened. 

Expression vectors for a variety of other species are known in the art. For 

25 example, Cao et al. report the expression of steroid receptor fragments in 

Staphylococcus aureus. J. Steroid Biochem Mol Biol. 44(1): 1-1 1 (1993), Also, Pla et 
al. have reported an expression vector that is functional in a number of relevant hosts 
including: Salmonella typhimurium, Pseudomonas putida, and Pseudomonas 
aeruginosa. J. Bacterid. 172(8):4448-55 (1990). These examples demonstrate the 

30 existence of molecular biology techniques capable of constructing expression vectors 

for the species of bacteria of interest to the present invention. 
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Following the subcloning of the identified nucleic acid sequences into an 
expression vector functional in the microorganism of interest, the identified nucleic acid 
sequences are conditionally transcribed to assay for bacterial growth inhibition. Those 
expression vectors found to contain sequences that, when transcribed, inhibit bacterial 
5 growth are compared to the known genomic sequence of the pathogenic microorganism 

being screened or, if the homologous sequence from the organism being screened is not 
known, it may be identified and isolated by hybridization to the proliferation-required 
E. coli sequence interest or by amplification using primers based on the proliferation- 
required E. coli sequence of interest as described above. 

10 The antisense sequences from the second organism w^hich are identified as 

described above may then be operably linked to a promoter, such as an inducible 
promoter, and introduced into the second organism. The tecliniques described herein 
for identifying E. coli genes required for proliferation may thus be employed to determine 
whether the identified sequences from a second organism inliibit the proliferation of the 

1 5 second organism. 

Antisense nucleic acids required for the proliferation of organisms other than E. 
coli or the genes corresponding thereto, may also be hybridized to a microarray 
containing the E. coli ORFs to gauge the homology between the E. coli sequences and 
the proliferafion-required nucleic acids from other organisms. For example, the 

20 proliferation-required nucleic acid may be from Staphylococcus aureus, Pseudomonas 

aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, 
Enter ococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, Salmonella 
typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans, 
Aspergillus fumigatus, Klebsiella pneumoniae. Salmonella typhi, Salmonella paratyphi, 

25 Salmonella cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, 

Mycobacterium leprae, Treponema pallidum, bacillus anthracis, Yersinia pestis, 
Clostridium botulinum, Campylobacter jejuni or Chlamydia trachomatus, Chlamydia 
pneumoniae or any species falling within the genera of any of the above species. The 
proliferation-required nucleic acids from an organism other than E. coli may be 

30 hybridized to the array under a variety of conditions which permit hybridization to 

occur when the probe has different levels of homology to the sequence on the 
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microarray. This would provide an indication of homology across the organisms as 
well as clues to other possible essential genes in these organisms. 

In still another embodiment, the exogenous nucleic acid sequences of the present 
invention that are identified as required for bacterial growth or proliferation can be used as 
5 antisense therapeutics for killing bacteria. The antisense sequences can be directed against 

the proliferation-required genes whose sequence corresponds to the exogenous nucleic 
acid probes identified here (i.e. the antisense nucleic acid may hybridize to the gene or a 
portion thereof). Alternatively, antisense therapeutics can be directed against operons in 
which proliferation-required genes reside (i.e. the antisense nucleic acid may hybridize to 

10 any gene in the operon in which the proliferation-required genes reside). Further, 

antisense therapeutics can be directed against a proliferation-required gene or portion 
thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a 
sequence within a gene), an intergenic sequence (i.e. a sequence between genes), a 
sequence spanning at least a portion of two or more genes, a 5' noncoding region or a 3' 

15 noncoding region located upstream or downstream from the actual sequence that is 

required for bacterial proliferation or an operon containing a proliferation-required gene. 

In addition to therapeutic applications, the present invention encompasses the use 
of nucleic acid sequences complementar)' to sequences required for proliferation as 
diagnostic tools. For example, nucleic acid probes complementary to prohferation- 

20 required sequences that are specific for particular species of microorganisms can be used 

as probes to identify particular microorganism species in clinical specimens. Tliis utility 
provides a rapid and dependable method by which to identify the causative agent or agents 
of a bacterial infection. This utility would provide clinicians the ability to prescribe 
species specific antimicrobial compounds to treat such infections. In an extension of this 

25 utility, antibodies generated against proteins translated from mRNA transcribed from 

proliferation-required sequences can also be used to screen for specific microorganisms 
that produce such proteins in a species-specific manner. 

The following examples teach the genes of the present invention and a subset of 
uses for the E. coli genes identified as required for proliferation. These examples are 

30 illustrative only and are not intended to limit the scope of the present invention. 

EXAMPLES 
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The following examples are directed to the identification and exploitation of E. coli 
genes required for proliferation. Methods of gene identification are discussed as well as a 
variety of methods to utilize the identified sequences. 

5 Genes Identified as Required for Proliferation of E. coli 

Exogenous nucleic acid sequences were cloned into an inducible expression vector 
and assayed for growth inhibition activity. Example 1 describes the examination of a 
library of exogenous nucleic acid sequences cloned into IPTG-inducible expression 
vectors. Upon activation or induction, the expression vectors produced an RNA molecule 

10 corresponding to the subcloned exogenous nucleic acid sequences. The RNA product was 

in an antisense orientation with respect to the E. coli genes from which it was originally 
derived. This antisense RNA then interacted with sense mRNA produced from various E. 
coli genes and interfered with or inhibited the translation of the sense messenger RNA 
(mRNA) thus preventing protein production from these sense mRNA molecules. In cases 

15 where the sense mRNA encoded a protein required for the proliferation, bacterial cells 

containing an activated expression vector failed to grow or grew at a substantially reduced 
rate. 

EXAMPLE 1 

Inhibition of Bacterial Proliferation after IPTG induction 
20 To study the effects of transcriptional induction in liquid medium, growth curves 

were carried out by back diluting cultures 1:200 into fresh media with or without 1 mM 
IPTG and measuring the OD450 every 30 minutes (min). To study the effects of 
transcriptional induction on solid medium, 10^ \Q\ \Q\ 10\ 10^ 10^ and 10^ fold 
dilutions of overnight cultures were prepared. Aliquots of from 0.5 to 3 |al of these 
25 dilutions were spotted on selective agar plates with or without 1 mM IPTG. After 

overnight incubation, the plates were compared to assess the sensitivity of the clones to 
IPTG, 

Of the numerous clones tested, some clones were identified as a containing 
sequence that inhibited £. coli growth after IPTG induction. Accordingly, the gene to 
30 which the inserted nucleic acid sequence corresponds, or a gene within the operon 

containing the inserted nucleic acid, may be required for proliferation in E. coli. 
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Characterization of Isolated Clones Negatively Affecting E, coli Proliferation 

Following the identification of those expression vectors that, upon expression, 
negatively impacted £. coli growth or proliferation, the inserts or nucleic acid fragments 
contained in those expression vectors were isolated for subsequent characterization, 
5 Expression vectors of interest were subjected to nucleic acid sequence determination. 

EXAMPLE 2 

Nucleic Acid Sequence Determination of Identified Clones Expressing Nucleic Acid 
Fragments with Detrimental Effects of E. coli Proliferation 
The nucleotide sequences for the exogenous identified sequences were detennined 
10 using plasmid DNA isolated using QIAPREP (Qiagen, Valencia, CA) and methods 

supphed by the manufacturer. Tlie primers used for sequencing the inserts were 5' - 
TGTTTATCAGACCGCTT - 3' (SEQ ID NO: 403) and 5^ - 
ACAATTTCACACAGCCTC - 3' (SEQ ID NO: 404). These sequences flank the 
polylinker in pLEXSBA. Sequence identification numbers (SEQ ID NOs) for the 
1 5 identified inserts are listed in Table I and discussed below. 

EXAMPLE 3 
Comparison Of Isolated Sequences to Known Sequences 
The nucleic acid sequences of the subcloned fragments obtained from the 
expression vectors discussed above were compared to known E. coli sequences in 
20 GenBank using BLAST version 1.4 or version 2.0.6 using the following default 

parameters: Filtering off, cost to open a gap=5, cost to extend a gap=2, penalty for a 
mismatch in the blast portion of run=-3, reward for a match in the blast portion of run=l, 
expectation value (e)=10.0, word size=ll, number of one-line descriptions=100, number 
of alignments to show (B)=100. BLAST is described in Altschul, J Mol Biol. 215:403-10 
25 (1990), the disclosure of which is incorporated herein by reference in its entiret} . 

Expression vectors were found to contain nucleic acid sequences in both the sense and 
antisense orientations. The presence of known genes, open reading frames, and ribosome 
binding sites was determined by comparison to public databases holding genetic 
information and various computer programs such as the Genetics Computer Group 
30 programs FRAMES and CODONPREFERENCE. Clones were designated as "antisense" 

if the cloned fragment was oriented to the promoter such that the RNA transcript produced 
was complementary to the expressed mRNA from a cliromosomal locus. Clones were 
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designated as ''sense" if they coded for an RNA fragment that was identical to a portion of 

a wild type mRNA from a chromosomal locus. 

The sequences described in Examples 1-2 that inhibited bacterial proliferation and 

contained gene fragments in an antisense orientation are listed in Table I. Tliis table lists 
5 each identified sequence by: a sequence identification number; a Molecule Number; a gene 

to which the identified sequence corresponds, listed according to the National Center for 

Biotechnology Information (NCBI), Blattner (Science 277:1453-1474(1997); also 

contains the E. coli K-12 genome sequence), or Rudd (Micro, and Mol. Rev. 62:985- 

1019 (1998)), (both papers are hereby incorporated by reference) nomenclatures. The 
10 CONTIG numbers for each identified sequence is showTi, as well as the location of the first 

and last base pairs located on the E. coli chromosome. A Molecule Number with a 

indicates a clone corresponding to an intergenic sequence. 

The sequences of the nucleic acid inserts of SEQ ID NOs: 1-81 from U.S. 

Provisional Patent Application No. 60/117,405 wliich inliibited proliferation were further 
15 analyzed. The reanalyzed sequences corresponding to SEQ ID NOs. 1-81 of U.S. 

Provisional Patent Application No. 60/117,405 have SEQ ID NOs. 405-485 in the 

present application. 

SEQ ID NOs: 82-242 in U.S. Provisional Patent Application No. 60/117,405 are 
identical to SEQ ID NOs: 82-242 of the present application with the following 

20 exceptions. SEQ ID NO: 148 in the present application is the complementar}' strand of 

SEQ ID NO: 148 in U.S. Provisional Patent Application No. 60/1 17,405. Accordingly, 
the protein of SEQ ID NO: 308 which is encoded by SEQ ID NO: 148 has also been 
revised. SEQ ID NO: 163 in the present application is the complementary strand of 
SEQ ID NO: 163 in U.S. Provisional Patent Application No. 60/117,405. Accordingly, 

25 the protein of SEQ ID NO: 323 which is encoded by SEQ ID NO: 163 has also been 

revised. 

The target gene of SEQ ID NOs. 18 and 19 of U.S. Provisional Patent 
Application No. 60/117,405 (SEQ ID NOs. 18, 19, 422, 423 of the present application) 
has been revised from dicF to ftsZ to reflect the fact that these SEQ ID NOs. include 
30 natural antisense molecules which inhibit ftsZ expression. 

The gene products of the nucleic acids of SEQ ID NOs. 198 and 239-242 in U.S. 
Provisional Patent Application No. 60/117,405 and in the present application (SEQ ID 
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NOs. 358 and 399-402 of the present application) have been revised to reflect the fact 
that these nucleic acids encode nontranslated tRNAs and rRNAs. Tables I and II have 
been revised accordingly. The SEQ ID NOs. in Table II were also revised to reflect the 
fact that SEQ ID NOs: 89 and 402 were identical in U.S. Provisional Patent Application 
5 No. 60/117,405. 

TABLE I 

Identified Clones with Corresponding Genes and Operons 



SEQ 


IMolecule 


Gene 


Gene 


Gene 


CONTIG 


ID 


No. 


(NCBI) 


(Blattner) 


(Rudd) 




NO. 












1, 405 


EcXAOOl 


yhhO 


h34~l 


yhhO 


AE000423 


2, 406 


EcXA002 


lepB 


lepB 


lepB 


AE000343 


3,407 


EcXA003 


J586 


b0955 


ycbZ 


AE000197 


4, 408 


EcXA004 


rpsG, rpsL 


b3341 


rpsG, 
rpsL 


AE000410 


5, 409 


EcXAOOSa 


rplL, rplJ 


b3986 


rplL, rplJ 


AE000472 


6,410 


EcXA005b 


rplL 


rplL 


rplL 


AE000472 


7,411 


EcXAOOSc 


rplL. rplJ 


rplL, rplJ 


rpIL, rplJ 


AE000472 


8. 412 


EcXA005d 


rplL. rplJ 


rplL, rplJ 


rplL, rpIJ 


AE000472 


9, 413 


EcXAOOSe 


rplL 


rplL 


rplL 


AE000472 


10. 414 


EcXAOOSf 


rplL 


rplL 


rplL 


AE000472 


1 1, 415 


EcXAOOSg 


rplL 


rplL 


rplL 


AE000472 


12, 416 


EcXA006 


ptti 


b229': 


pto 


AE000319 


13. 417 


EcXA007 


yicP 


b3666 


yicP 


AE000444 


14. 418 


EcXAOOSa 


yhaU 


b3127 


yhaU 


AE000394 


15. 419 


EcXAOOSb 


yhaU 


yhaU 


yhaU 


AE000394 


16. 420 


EcXAOOSe 


yhaU 


yhaU 


yhaU 


AE000394 


17. 421 


EcXA009 


ydcY 


ydcY 


ydeY 


AE000249 


18, 422 


EcXAOlOa 
(natural as) 


dicF 


bJ575 


dicF 


AE000253 


19, 423 


EcXAOlOb 


dicF 


dicF 


dicF 


AE000253 


20, 424 


EcXAOll 


fdnG 


bJ4'74 


fdnG 


AE000244 


21.425 


EcXA012a 


fusA 


b3340 


Jus A 


AE000410 


22, 426 


EcXA012b 


fusA 


fusA 


J us A 


AE000410 


23, 427 


EcXA012c 


fusA 


fusA 


fusA 


AE000410 


24, 428 


EcXA013a 


o86 


b2562 


yfhL 


AE000342 


25, 429 


EcXA013b 


o86 


b2562 


vfhL 


AE000342 


26, 430 


EcXA013c 


o86 


b2562 


yJhL 


AE000342 


27, 431 


EcXAOH 


visC 


b2906 


visC 


AE000374 


28, 432 


EcXA015 


yfdi 


yjdi 


yfdi 


AE000323 


29, 433 


EcXA016 


yeaO 


yeaO 


yeoQ 


AE000274 






yociG 


yoaG 


yoaG 
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SEQ 
ID 
NO. 


Molecule 
No. 


Gene 
(NCBI) 


Gene 
(Blattner) 


Gene 
(Rudd) 


CONTIG 


OA 111 

30, 4j4 


hc\A017a 


yggE 


h2922 


yggE 


AE000375 


T 1 lie 

31, 435 


hcXAOI/b 






yggE 




32,436 


EcXA018a 


o464 


h2074 


yegM 


AE000297 


33, 437 


EcXAOlSb 


o464 


b2074 


yeghl 


AE000297 


34, 438 


EcXA019a 


yefiA 


yehA 


yehA 


AE000300 












AE000299 


35, 439 


EcXA019b 


ol 72, yehA 


ol72, yehA 


ol'^2. 

yehA 


AE000299 


36, 440 


EcXA020 


o3H4. J82 


bl 794. 
hr95 


yeaP. 
yeaO 


AE000274 


37, 441 


EcXA021a 


1112 


h(l2IiS 


yafU 


AEOOOl jU 


38,442 


EcXA021b 


J112 


b021S 


yafU 


AE000130 


39, 443 


EcXA022 


o':'40 


bI629 


yclgN 


AE000258 


40, 444 


EcXA023a 


fl76,j382 


hl304, 
bl505 


ydeS, 
ydeT 


AE000247 


41, 445 


EcXA023b 


fl 76, f382 


bI504, 
bl505 


ydeS, 
'ydeT 


AE000247 


42, 446 


EcXA024 


yg/M ygjN 


b3082 


ygjN 


AE000390 


4j, 447 


EcXAU25 




bJS^S 


veeJ 


A r^/^AAOO/\ 

AE000289 


44, 448 


EcXAU26 




Lnpre- 
diclea 


Ln/V'C' 
dieted 


y\E0001 j8 


4^, 449 


EcXA027a 


1 1 T 

yohti 


17 7 

yohH 


1 7 T 

yohH 


AE000303 


46, 450 


EcXA027b 


yohH 


17 7 

yohH 


17 7 

yohH 


A A A ^ A ^ 

AEOOOjOj 


47, 451 


EcXA027c 


17 7 

yohH 


1 rr 

yohH 


yohH 


A "I~"AAA'^A^ 

AEOOOjOj 






yon I 


yohl 


yohl 




1 O 1 CO 

48. 452 


rr^A.'^ A A'T7 J 


yohH 


yohH 


/, ij 
yohH 


\ CAAA'^A'* 


49, 453 


^ "V A AO O 

EcXA028 


i ") /"l /T 

J296 


b23u:> 


yjel 


\ T~'AAA'^ 1 A 


50, 454 


EcXA029 




b439J 




AE000509 


51, 455 


EcXA()30 


yi5A 


b355'' 


yi5A 


A "T^ f\f\r\ H o 

AE000433 


52, 456 


EcXA03I 


rplE 


B33()8 


rplE 


AE000408 


53, 457 


EcXA032a 


ybgD 


ybgD 


ybgD 


AEOOOl 7:) 


54. 458 


EcXA032b 

if. if. 


ybgD 


ybgD 


ybgD 


AEOOOl 7:) 






gUA 


gllA 


git A 




55, 459 


EcXA033a 


j477 (as) 


b3052 


waat 


A A AAT O n 

Ah000357 












A A A AO O C 

Ab000386 


56, 460 


EcXA033b 


J 47^ 


b3U:)2 


waat 


A T^AAAOO'7 

AE000387 


57, 461 


>^A/ A AT 4 ^ 

EcXA034a 


4 

cspA 




espA 


A T^AAA 1 ^ ^ 


JO, 4o2 


hCAAUj)4D 


cspA 


nJjjn 


espA 


AbUUU4jj 


59, 463 


EcXA035 


yhjU 


yhjU 


yhjU 


AE00043 1 


60, 464 


EcXA036 




bSlOl 


yqj'F 


AE000392 






o99 


bSJOO, 






61,465 


EcXA037 


ydeH 


bl535 


ydeH 


AE000251 
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SEQ 


Molecule 


Gene 


Gene 


Gene 


CONTIG 


ID 
NO. 


No. 


(NCBI) 


(Blattner) 


(Rudd) 




62, 466 


EcXA038 


sieB 


bl353 


sieB 


AE000233 


63, 467 


EcXA039 


ybbD 




ybbD 


AE000156 


64, 468 


EcXA040 


InsB 6 


b3445 


imB_6 


AE000420 


65, 469 


EcXA041 


/234 


bll38 


ymfE 


AE000214 


66, 470 


EcXA042a 


rplY 


rplY 


rplY 


AE000308 


67, 471 


EcXA042b 


rplY 


rplY 


rplY 


AE000308 


68, 472 


EcXA043 


ybgB 


ybgB 


ybgB 


AE000176 






cydA 


cydA 


cydA 




69, 473 


EcXA044 


purB 


bll31 


purB 


AE000213 


70, 474 


EcXA045* 

♦ 


csrA 


csrA 


csrA 


AE000353 






serV 


serV 


ser \ ' 




71,475 


EcXA046* 

* 


funE, JimA 


b4313 


funE, 
JimA 


AE000502 


72, 476 


EcXA047* 

* 


J96, cspB 


P6, cspB 


cspB, 
yd/S 


AE000252 


73,477 


EcXA048 




yeJE 


yeJE 


AE000294 


74. 478 


EcXA049 


yaiC 


b0385 


yaiC 


AE000145 


75, 479 


EcXA050 


u467, o222 


yaiU,yai\' 


yaiV. 
yai I ' 


AE000144 


76, 480 


EcXAOSla 


rplB, rplW 


rplB, rplW 


rplB, 
rplW 


AE000408 


77. 481 


EcXA051b 


rplW 


rplW 


rplW 


AE000408 


78, 482 


EcXA052 


infC 


WC 


infC 


AE000267 












AE000266 


79, 483 


EcXA053 


gor 


gor 


gor 


AE000426 


80, 484 


EcXA054 


rplF 


rplF 


rplF 


AE000408 


81,485 


EcXA055 


rrlG 


rrlG 


rrlG 


AE000345 



EXAMPLE 4 

Identification of Genes and their Corresponding Operons Affected by Antisense Inhibition 
The sequencing of the entire E. coli genome is described in Blattner et al, Science 
5 277:1453-1474(1997) the entirety of which is hereby incorporated by reference and the 

sequence of the genome is listed in GenBank Accession No.U00096, the disclosure of 
which is incorporated herein by reference in its entirety. The operons to which the 
proliferation-inhibiting nucleic acids correspond were identified using RegulonDB and 
information in the literature. The coordinates of the boundaries of these operons on the E. 
10 coli genome are listed in Table III. Table II lists the molecule numbers of the inserts 

containing the growth inhibiting nucleic acid fragments, the genes in the operons 
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corresponding to the inserts, the SEQ ID NOs of the genes containing the inserts, the SEQ 
ID NOs of the proteins encoded by the genes, the start and stop points of the genes on the 
E. coli genome, the orientation of the genes on the genome, whether the operons are 
predicted or documented, and the predicted functions of the genes. The identified operons, 
5 their putative functions, and whether or not the genes are presently thought to be required 

for prohferation are discussed below. 

Functions for the identified genes were determined by using either Blattner 
functional class designations or by comparing identified sequence with known sequences 
in various databases. A variety of biological functions were noted for the genes to wliich 
1 0 the clones of the present invention correspond. Tlie functions for the genes of interest 

appear in Table II. 

The proteins that are listed in Table II are involved in a wide range of biological 
functions. 
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Predicted functional 
class of encoded 
proteins 


Translation (rRNA) 


Translation 


Translation 


Translation 


Carbon compound 
catabolism 


Probable adenine 
deaminase 








Blattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

'modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Carbon compound 
catabolism 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Putative enzymes 


Hypothetical ORF, 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 
(D) 
Operon 






Q 














Right 
Coordinate 


2729178 


3471815 


4178071 


4178503 


2414911 


3843357 


3269492 


3270407 


3271198 


Left 
Coordinate 


2727636 


3471815 


4177574 


4178138 


2412767 


3841591 


3268266 


oc 
o 
m, 
O^ 


3270428 


Genes On 
Operon 


!^ 


<] 










yhaD 




yhciF 


Mole. No. 


EcXA055 




EcXA005a-g 




EcXA006 


EcXA007 


EcXA008a-c 






Gene 
Prod. Seq 
ID No. 


o 


O 
in 
(N 


in 


in 
rJ 


m 
in 


in 


in 
in 


r\ 


in 


GeneScq 
ID 
No. 


ON 
OO 


O 

ON 


ON 


rj 

ON 


m 

ON 


o^. 


in 

ON 


ON 


1^ 

ON 



Predicted functional 
class of encoded 
proteins 


Probable integral 
membrane protein 
Phthalate permease 
family 




Putative ABC 
transporter 










Regulator of cell 
division 


Anaerobic respiration 
(formate dehydro- 


Blattner functional 
class of encoded 
proteins 


Carbon compound 
catabolism 


Putative transport 
proteins 


Putative transport 
proteins 


Hypothetical 0R1\ 

unclassified, 

unknown 


Hypothetical 0R1\ 

unclassified, 

unknown 


Hypothetical 0R1\ 

unclassified, 

unknown 


Hypothetical ORI\ 

unclassified, 

unknown 


Cell processes (incl. 

Adaptation, 

protection) 


Knergy metabolism 


Predicted 
(P) Or 
Documented 
(D) 
Operon 
















oT 




Right 
Coordinate 


3272548 


1601049 


1602071 


1603063 


1604097 


1604999 


1605313 


106456 


1548472 


Left 
Coordinate 


3271214 


1599514 


1601043 


1602071 


1603075 


1604124 


1605023 


105305 


1545425 


Genes On 
Operon 


yhalJ 






N 






L) 
s: 


N 


fdnG 


Mole. No. 




EcXA009 












EcXAOlOa-b 


EcXAOll 


Gene 
Prod. Seq 
ID No. 


oo 


a> 

to 
ri 


o 




c\ 






VO 
CN 


r-i 


GeneSeq 
ID 
No. 


CO 


as 
as 


o 
o 


o 


n 
o 


en 
O 


o 


O 


o 



Predicted functional 
class of encoded 
proteins 


genase) 








No homologues, no 
motifs 


Ubiquinone synthesis 










Blattner functional 
class of encoded 
proteins 




Energy metabolism 


Energy metabolism 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Biosynthesis of 
cofactors, prosthetic 
groups and carriers 


t ranslation, post- 

translational 

modification 


1 lypothctical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 
(D) 
Operon 




















Right 
Coordinate 




1549369 


1550015 


2697943 


3050337 


3051538 


3052860 


3053470 


o 

f~' } 


Left 
Coordinate 




1548485 


1549362 


2697683 


3049135 


3050360 


3051535 


s 3052886 


'O 

oo 
'n 
o 


Genes On 
Operon 




JdnH 


fdnl 


Same 
operon as 
EcXA004 




.'^ 










Mole. No. 








EcXA012a-c 


EcXA013a-c 


EcXAOM 








o 

< 


Gene 
Prod. Seq 
ID No. 






oo 

CM 




OS 


o 




rj 


rj 


r4 


GeneSeq 
ID 
No. 




o 


OO 

o 




OS 
O 


o 






m 


-1- 



Predicted functional 
class of encoded 
proteins 




Putative membrane 
protein 


Homologue to 
transgly-cosylase 
associated protein 


No homologues 




Homologues in 
multiple bacteria, no 
motifs 


Transport (multiple 

transferable 

resistance) 






Blattner functional 
class of encoded 
proteins 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Structural proteins 


Putative transport 
proteins 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 
(D) 
Operon 




















Right 
Coordinate 


2467154 


2468482 


1877279 


1877609 


1877972 


3066100 


2153285 


2156407 


2159485 


Left 
Coordinate 


2466234 


2467151 


1877031 


1877427 


1877613 


3065360 


2151891 


2153285 


2156408 


Genes On 
Operon 


1 


1 


>^ 


yoaG 












Mole. No. 






EcXA016 






EcXA017a-b 


EcXA018a-b 






Gene 
Prod. Seq 
ID No. 








oo 
rj 


ON 


o 

oo 


oo 


oo 
rj 


m 
oo 


GeneSeq 
ID 
No. 








oo 


ON 


o 


rl 




m 



Predicted functional 
class of encoded 
proteins 




Weak homology to 
pilin precursor from 
H. Inf. 








Same 
operon as 
EcXA016 
(one of 
the two) 


Homologues in // 
Inf and S. Pombe., no 
motifs, transmem- 
brane region present 






Blattner functional 
class of encoded 
proteins 


Putative transport 
proteins 


Cell structure 


1 lypothetical ORI% 

unclassified, 

unknown 


Putative chaperones 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


1 lypothetical ORF, 

unclassified. 

unknown 


Predicted 
(P) Or 
Documented 

Operon 


















Right 
Coordinate 


2160901 


2186434 


2188930 


2189665 


2190242 


239084 


1704372 


1704950 


Left 
Coordinate 


2159486 


2185400 


2186450 


2188^)46 


2189700 


238740 


1703791 


1704372 


Genes On 
Operon 




-i: 




yehC 


Q 








Mole. No. 




EcXA019a-b 








EcXA020 


EcXA021a-b 


EcXA022 




Gene 
Prod. Seq 
ID No, 


oo 


oo 


oo 

CJ 


r-^ 
oo 
ri 


oo 
oo 
rj 




ON 

OO 

c\ 


o 




GeneSeq 
ID 
No. 










oo 
rj 




r J 







Predicted functional 
class of encoded 
proteins 














fimf-like 


fimd-like 


Weak homology to 
long chain fatty acid 
coa ligase in 


Blattner functional 
class of encoded 
proteins 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Transcription, RNA 
processing and 
degradation 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothefical ORF, 

unclassified, 

unknown 


Structural proteins 


Hypothetical ORF, 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 
(D) 
Operon 




















Right 
Coordinate 


1707165 


1708224 


1708848 


1709547 


1710182 


1586320 


1586863 


1588025 


3231785 


Left 
Coordinate 


1 704943 


i 1707166 


1708228 


1708852 


1709547 


1585817 


C^i 

\C' 
OC' 


1586877 


3231369 


Genes On 
Operon 


1 


ydgO 












ydcT 


% 


Mole. No. 












EcXA023a-b 






r J 
o 
< 
X 


Gene 
Prod. Seq 
ID No. 


ON 


m 

ON 


OS 


ON 


ON 


ON 


oo 


ON 
ON 


o 
o 


GeneSeq 
ID 
No. 


ri 












oo 


0^ 


O 
1- 



Predicted functional 
class of encoded 
proteins 


Archaeglobus 


Homologues in 
various bacteria 


Strong similarity to 
numerous attaching 
amd effacing proteins 
and invasins 


nifm like 




Xylose binding 
protein-like 




Similar to S. Typhi 
histidine transport 
gene 


Similar lo ABC 
transporter 


Blattner functional 
class of encoded 
proteins 




Hypothetical ORF, 

unclassified, 

unknown 


I lypothctical ORF, 

unclassified, 

unknown 




Putative transport 
proteins 


Hypothetical ORF, 
unclassified, 
: unknown 


Putative regulatory 
protein 


Hypothetical ORF, 

unclassified, 

unknown 


1 lypothctical ORF, 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 
(D) 
Operon 








unpredicted 












Right 
Coordinate 




3232096 


2050036 


331184 


2226539 


2226859 


2228405 


2421559 


4628091 


Left 
Coordinate 




3231782 


2042885 


331001 


2225343 


2226569 


2227458 


2420669 


4626424 


Genes On 
Operon 






yccJ 






yohH 


, yohl 






Mole. No. 






ri 
O 

s 


EcXA026 


EcXA027a-d 






1 EcXA028 


EcXA029 


Gene 
Prod. Seq 
ID No. 




o 
m 


ri 

o 
m 


o 

ro 


o 
m 


to 
o 

ro 


\o 
o 
m 


o 
m 


oo 
o 


GeneSeq 
ID 
No. 




5 




rn 


t 








oo 



Predicted functional 
class of encoded 
proteins 


ISlSOorf A 
















Translation 


Biattner functional 
class of encoded 
proteins 


Hypothetical ORF, 

unclassified, 

unknown 


Phage, transposon, or 
plasmid 


Translation, post- 

translational 

modification 


Putative transport 
proteins 


1 ranslation, post- 

translational 

modification 


1 ranslation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Predicted 
(P) Or 
Documented 
(D) 
Operon 






Q 














Right 
Coordinate 


3718830 


3719678 


3440371 


3441734 


3442176 


3442359 


3442866 


3443234 


3443777 


Left 
Coordinate 


3718309 


3718827 


3440255 


3440403 


3441742 


3442180 


3442363 


3442881 


3443244 


Genes On 
Operon 






rpniJ 




rplO 


rpmD 


rpsE 


rplR 


rpIF 


Mole. No. 


EcXA030 




EcXA031 














Gene 
Prod. Seq 
ID No. 


ON 

o 


o 

m 




r4 


m 


m 






CO 


GeneSeq 
ID 
No. 


ON 


o 








to 









Predicted functional 
class of encoded 
proteins 






Translation 






Hypothetical fimbrial 
protein 


Glutamine 
biosynthesis 


ADP heptose 
synthase/ autotrophic 
growth protein 




Blattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Cell processes (inch 

Adaptation, 

protection) 


Energy metabolism 


Putative enzymes 


Translation, post- 
translational 


Predicted 
(P) Or 
Documented 
(D) 
Operon 














Q 






Right 
Coordinate 


3444182 


3444521 


3445075 


3445404 


3445786 


752018 


753691 


3194394 


3197282 


Left 
Coordinate 


3443790 


3444216 


3444536 


3445090 


3445415 


751452 


752408 


3192961 


3194442 


Genes On 
Operon 


rpsH \ 


rpsN 




rplX 






gUA 






Mole. No. 












EcXA032a-b 




EcXA033a-b 




Gene 
Prod. Seq 
ID No. 


OO 


o\ 

rn 


o 

m 


m 


m 


m 
c\ 
m 


^ 

m 


in 
r4 

m 


\o 
m 


GeneSeq 
ID 
No. 


OO 

»n 


O 
in 


o 




c\ 


m 




in 





Predicted functional 
class of encoded 
proteins 






RNA chaperonin 






Regions similar to 
dchydro-genases, 
nucleases etc. 








Blattner functional 
class of encoded 
proteins 


modification 


Hypothetical ORF, 

unclassified, 

unknown 


Cell processes (incl. 

Adaptation, 

protection) 


1 Translation, post- 
^ translational 
'modification 


1 Hypothetical ORF, 
1 unclassified, 
1 unknown 


Hypothetical ORF, 

unclassified, 

unknown 


1 lypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


1 lypothetical ORF, 
unclassified, 


Predicted 
(P) Or 
Documented 
(D) 
Operon 








ST 












Right 
Coordinate 




3198606 


3717890 


3695658 


3695846 


3697522 


3246977 


3247320 


3247727 


Left 
Coordinate 




3197305 


3717678 


3694087 


3695658 


3695843 


324()594 


3247015 


r] 
cn 


Genes On 
Operon 




ygiF 


cspA 


yhjS 


yhjT 


:^ 








Mole. No. 






EcXA034a-b 


EcXA035 






< 
X 

W 






Gene 
Prod. Seq 
ID No. 






00 

CN 

m 


ON 

cn 


O 

m 
cn 




rj 
cn 


m 
m 


m 


GeneSeq 
ID 
No. 




NO 


OO 
NO 


ON 


o 






n-i 





Predicted functional 
class of encoded 
proteins 






Homologues in many 
bacteria, blocks; 
secretion/ ATP 
synthase/ftsz 


Similar to carboxy- 
kinase, oxidase, 
symporters 


Super-infection 
exclusion factor B- 
like 








Rlis-like element 


Blattner functional 
class of encoded 
proteins 


unknown 


Similar to mukb from 
H. Inf 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Phage, transposon, or 
plasmid 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 
unclassified, 


Predicted 
(P) Or 
Documented 

(0) 
Operon 




















Right 
Coordinate 




3248016 


3248594 


1621874 


1417183 


1417368 


526765 


527173 


527883 


Left 
Coordinate 




3247717 


3248112 


1620984 


1416572 


1417192 


522485 


526805 


527173 


Genes On 
Operon 




yqjK 








rajB 
(hI354) 


Q 




ylhH ; 


Mole. No. 








m 
o 

< 
o 


EcXA038 




EcXA039 






Gene 
Prod. Seq 
ID No. 




to 

CO 


m 


m 
m 


OO 

m 


ON 

m 


o 

m 




rsi 


GeneSeq 
ID 
No. 




r- 


<:> 




oo 


ON 


oo 


oo 


r4 
oo 



Predicted functional 
class of encoded 
proteins 




ATP synthase, 
desaturase 












No assigned role 


No assigned role 


Blattner functional 
class of encoded 
proteins 


unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Phage, transposon, or 
plasmid 


Phage, transposon, or 
plasmid 


Hypothetical 0R1% 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


1 lypothetical ORF. 

unclassified, 

unknown 


Hypothetical ORF\ 

unclassified, 

unknown 


Predicted 
(P) Or 
Documented 

(D) 
Operon 




















Right 
Coordinate 




528124 


528354 


351389 


3581811 


3581085 


3580672 


1196755 


1197460 


Left 
Coordinate 




527864 


528163 


351114 


351308 


3580669 


3579494 


1196090 


11%756 


Genes On 
Operon 




ybbD 




1 

•E 


■S 


yrhA 


yhhZ 


1 


1 


Mole. No. 








EcXA040 








o 

< 

X 




Gene 
Prod. Seq 
ID No. 




m 
m 




m 






oo 




o 

r^i 


GeneSeq 
ID 
No. 




m 
oo 


00 


to 

00 


vO 
OO 


oo 


oo 
oo 


a\ 
oo 


O 



to 



Predicted functional 
class of encoded 
proteins 


Translation 




Unknown 


Cytochrome D 
oxidase 




Purine biosynthesis 


Carbon storage 
regulator (mRNA 
decay factor) 


Translation (tRNA) 




Fimbrac 


Regulator of 
inversion 




Biattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Carbon compound 
catabolism 


linergy metabolism 


1 Energy metabolism 


Nucleotide 
biosynthesis and 
metabolism 


Regulatory function 


Translation, post- 

translational 

modification 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Predicted 
(P) Or 
Documented 
(D) 
Operon 








g 




Q 




Unpredicted 


Q 








Right 
Coordinate 


22808211 


767183 


769834 


772249 


773404 


1191209 


2817168 


2816667 


4539127 


4540201 


4541231 


4541835 


Left 
Coordinate 


2280537 


765207 


767201 


770678 


772265 


OO 

c^- 

OO 


28)6983 

i 


2816575 


4538525 


4539605 


4540683 


4541188 


Genes On 
Operon 


rplY 

1 




yhgB 




cydB 


cq 




:^ 

>u 




^ 
S 


S 


fiml 


Mole. No. 


EcXA042a-b 


EcXA043 








< 
X 


EcXA045 




< 

X 
W 








Gene 
Prod. Seq 
ID No. 




in 
m 


m 
in 
m 


in 


in 
in 
m 


\o 

m 


mi 
m 


00 

vn 
m 


o^. 


o 
m 


\o 
m 


\o 
cn 


GeneSeq 
ID 
No. 

.1 




ON 


m 




in 
CN 






oc 




o 


O 


r\ 
o 



in 



Predicted functional 
class of encoded 
proteins 


















Lysis protein 










Blattner functional 
class of encoded 
proteins 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


■ Hypothetical ORF, 
unclassified, 
unknown 


Cell processes (incl. 

Adaptation, 

protection) 


Phage, transposon, or 
plasmid | 


Putative enzymes 


Hypothetical ORF, 
unclassified, 


Predicted 
(P) Or 
Documented 
(D) 
Operon 




























Right 
Coordinate 


4542597; 


4545301 


4545841 


4546357 


4547279 


1638684 


1638081 


1638389 


1638684 


1639578 


2100933 


2101411 


2102531 


Left 
Coordinate 


4541872 


4542665 


4545311 


4545854 


4546377 


1637054 


1637548 


1638078 


1638394 


1639363 


2099917 


2100938 


2101413 


Genes On 
Operon 




Q 


JimF ! 






% 








cspB 


1 






Mole. No. 












EcXA047 










EcXA048 






Gene 
Prod. Seq 
ID No, 


VO 
m 




in 

m 






OO 

\o 
cn 


ON 

m 


o 

m 


m 


m 


m 
r- 
cn 


cn 


m 


GeneSeq 
ID 
No. 


m 
O 


o 


tn 
O 
(N 


o 


r-- 
o 
rsi 


oo 
o 

Cn) 


ON 

o 

(N 


O 




r-J 


cn 
r1 




in 



in 



Predicted functional 
class of encoded 
proteins 










UDP galacto- 
pyranase mutase 


Unknown 


Putative auto- 
transporter 


Hypothetical outer 
membrane protein 






Blattner functional 
class of encoded 
proteins 


unknown 


Putative enzymes 


Hypothetical ORF, 

unclassified, 

unknown 


Cell structure 


Hypothetical ORF, 

unclassified, 

unknown 


Hypothetical ORF, 

unclassified, 

unknown 


Putative enzymes 


Hypothetical ORF, 

unclassified. 

unknown 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Predicted 
(P) Or 
Documented 
(D) 
Operon 


















Q 




Right 
Coordinate 




2103106 


2104079 


2105248 


2106351 


404042 


r\ 


394353 


3446205 


a- 
\o 


Left 
1 Coordinate 




2102516 


2103087 


2104082 


2105248 


402927 


r\ 
r> 


393685 


3445951 


3446205 


Genes On 
Operon 












yuiC 


yaiU 


yai V 


rpsO 




Mole. No. 












EcXA049 


< 

UJ 




EcXA051a-b 




Gene 
Prod. Seq 
ID No. 






m 


00 


ON 


o 
oo 

m 


CO 
r^i 


c\ 

00 

m 


oo 


00 


GeneSeq 
ID 
No. 








OO 

r4 


r4 


o 
rJ 
r4 


C\ 
C\ 




c\ 
c\ 


CnI 



Predicted functional 
class of encoded 
proteins 










Translation 


Translation 






Blattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Predicted 
(P) Or 
Documented 
(D) 
Operon 


















Right 
Coordinate 


3446806 


3447520 


3447870 


3448163 


1- 


3449321 


a- 


3450563 


Left 
Coordinate 


3446396 


; 3446819 


3447538 


3447885 


3448180 


3449019 


00 


cr 


Genes On 
Operon 


rplP 


g. 






&^ 








Mole. No, 


















Gene 
Prod. Seq 
ID No. 


oo 


OO 


00 

m 


oo 

00 

cn 


oo 
m 


o 

ON 

m 


CO 


rj 
0^ 
r^i 


GeneSeq 
ID 
No. 


iv^ 
(N 
(N 


ri 
ri 




oo 
<N 


o 


O 
rsi 


m 


r\ 



Predicted functional 
class of encoded 
proteins 








j Translation 




Glutathione oxido- 
reductase 




< 
Z 

.2 

?5 


Blattncr functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Biosynthesis of 
cofactors, prosthetic 
! groups and carriers 


Translation, post- 

translational 

modification 


Predicted 
(P) Or 
Documented 
(D) 
Operon 




Q 










a 


Right 
Coordinate 


3450907 


1797773 


1798023 


1798662 


1800594 


3645281 


2727204 


Left 
Coordinate 


3450596 


1797417 


1797826 


1798120 


1798666 


3643929 


2724301 


Genes On 
Operon 




rplT 


rpml 






gor 


Same 
operon as 
EcXA031 


'"0 
■-^ 

:^ 


Mole. No. 




EcXA052 








EcXA053 


EcXA054 


EcXA055 


Gene 
Prod. Seq 
ID No. 


ON 

m 


ON 

m 


ON. 


ON 


ON 


cc 

ON 

rn 




ON 
ON 

m 


GeneSeq 
ID 
No. 


m 
m 

CM 




m 






oo 
m 
r J 




m 
r J 



lo 



Predicted functional 
class of encoded 
proteins 


Translation (rRNA) 


Translation (tRNA) 


Translation (rRNA) 


Blattner functional 
class of encoded 
proteins 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Translation, post- 

translational 

modification 


Predicted 
(P) Or 
Documented 

(D) 
Operon 








Right 
Coordinate 


2724208 


2727464 


2729178 


Left 
Coordinate 


2724089 


2727389 


2727636 


Genes On 
Operon 


rrfG 






Mole. No. 








Gene 
Prod. Seq 
ID No. 


o 
o 


o 


r\ 
o 


GeneSeq 
ID 
No. 


o 







in 




Several of the expression vectors contain fragments that correspond to genes of 
unknown function or if the function is known, it is not known whether the gene is 
essential. For example, EcXAOOl, 003, 007, 008, 013, 015, 016, 017, 018, 019, 020, 021, 
022, 023, 024, 025, 026, 027, 028, 029, 030, 032, 033, 034, 035, 036, 037, 038, 039, 040, 
5 041, 047, 048, 049 and 050 are all exogenous nucleic acid sequences that correspond to E. 

coli proteins that have no known function or where the function has not been shown to be 
essential or nonessential. 

The present invention reports a number of novel E. coli genes and operons that are 
required for proliferation. From the list clone sequences identified here, each was 

10 identified to be a portion of a gene in an operon required for the proliferation of E. coli. 

Cloned sequences corresponding to genes already known to be required for proliferation in 
£. coli include EcXA002, 004, 005, 010, 012, 014, 031, 02, 043, 045, 051, 052, 054, and 
055. The remaining identified sequences correspond to E. coli genes previously 
undesignated as required for proliferation in the art. 

15 An interesting observation of the present invention is that there are also several 

sequence fragments that correspond to coli genes that are not thought to be required for 
E. coli proliferation. Nevertheless, under the conditions described above, the antisense 
expression of these gene fragments causes a reduction in cell growlh. This result implies 
that the genes corresponding to the identified sequences are actually required for 

20 proliferation. Molecule Nos. corresponding to these genes are EcXA006, 044, 046, and 

053. 

Following identification of the sequences of interest, these sequences were 
localized into operons. Since bacterial genes are expressed in a polycistronic manner, the 
antisense inhibition of a single gene in an operon might effect the expression of all the 
25 other genes on the operon or the genes down stream from the single gene identified. In 

order to determine which of the gene products in an operon are required for proliferation, 
each of the genes contained within an operon may be analyzed for their effect on viability 
as described below. 
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EXAMPLE 5 

Identification of Individual Genes within an Qperon Required for Proliferation 
The following example illustrates a method for determining which gene in an 
5 operon is required for proliferation. The clone insert corresponding to Molecule No. 

EcXA004 possesses nucleic acid sequence homology to the E, coli genes rspG and 
rspL, This molecule corresponds to an operon containing two additional genes fusA and 
iufA. The rpsL gene is the first gene in the operon. To determine which gene or genes 
in this operon are required for proliferation, each gene is selectively inactivated using 
10 homologous recombination. Gene rpsL is the first gene to be inactivated. 

Deletion inactivation of a chromosomal copy of a gene in E. coli can be 
accomplished by integrative gene replacement. The principle of this method (Hamilton, 
C. M., et al 1989. J. BacterioL 171: 4617-4622) is to construct a mutant allele of the 
targeted gene, introduce that allele into the chromosome using a conditional suicide 
15 vector, and then force the removal of the native wild type allele and vector sequences. 

This will replace the native gene with a desired mutation(s) but leave promoters, 
operators, etc. intact. Essentiality of a gene is determined either by deduction from 
genetic analysis or by condhional expression of a wild type copy of the targeted gene 
(trans complementation). 
20 The first step is to generate a mutant rpsL allele using PGR amplification. Two 

sets of PGR primers are chosen to produce a copy of rpsL with a large central deletion 
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to inactivate the gene. In order to eliminate polar effects, it is desirable to construct a 
mutant allele comprising an in-frame deletion of most or all of the coding region of the 
rpsL gene. Each set of PCR primers is chosen such that a region flanking the gene to be 
amplified is sufficiently long to allow recombination (typically at least 500 nucleotides 
5 on each side of the deletion). The targeted deletion or mutation will be contained within 

this fragment. To facilitate cloning of the PCR product, the PCR primers may also 
contain restriction endonuclease sites found in the cloning region of a conditional 
knockout vector such as pK03 (Link, et al 1997 J. Bacterial. 179 (20): 6228-6237). 
Suitable sites include NotI, Sail, BamHI and Smal. The rpsL gene fragments are 

10 produced using standard PCR conditions including, but not limited to, those outlined in 

the manufacturers directions for the Hot Start Taq PCR kit (Qiagen, Inc., Valencia, 
CA). The PCR reactions wdll produce two fragments that can be fused together. 
Alternatively, crossover PCR can be used to generate a desired deletion in one step (Ho, 
S. N., et al 1989. Gene 11: 51-59, Horton, R. M., et al 1989. Gene 11: 61-68). The 

15 mutant allele thus produced is called a ''null" allele because it cannot produce a 

functional gene product. 

The mutant allele obtained from PCR amplification is cloned into the multiple 
cloning site of pK03. Directional cloning of the rpsL null allele is not necessary. The 
pK03 vector has a temperature-sensitive origin of replication derived from pSClOl. 

20 Therefore, clones are propagated at the permissive temperature of 30T. The vector 

also contains two selectable marker genes: one that confers resistance to 
chloramphenicol and another, the Bacillus subtilis sacB gene, that allows for counter- 
selection on sucrose containing growth medium. Clones that contain vector DNA with 
the null allele inserted are confirmed by restriction endonuclease analysis and DNA 

25 sequence analysis of isolated plasmid DNA. The plasmid containing the rpsL null allele 

insert is knovvTi as a knockout plasmid. 

Once the knockout plasmid has been constructed and its sequence verified, it is 
transformed into a Rec"^ E. coli host cell. Transformation can be by any standard 
method such as electroporation. In some fraction of the transformed cells, plasmids will 

30 integrate into the E. coli chromosome by homologous recombination between the rpsL 

null allele in the plasmid and the rpsL gene in the chromosome. Transformant colonies 
in which such an event has occurred are readily selected by growth at the non- 
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permissive temperature of 43''C and in the presence of choramphenicol. At this 
temperature, the plasmid will not repHcate as an episome and will be lost from cells as 
they grow and divide. These cells are no longer resistant to chloramphenicol and will 
not grow when it is present. However, cells in which the knockout plasmid has 
5 integrated into the E. coli cliromosome remain resistant to chloramphenicol and 

propagate. 

Cells containing integrated knock-out plasmids are usually the result of a single 
crossover event that creates a tandem repeat of the mutant and native wild type alleles 
of rpsL separated by the vector sequences. A consequence of this is that rpsL will still 

10 be expressed in these cells. In order to determine if the gene is essential for growth, the 

wild type copy must be removed. This is accomplished by selecting for plasmid 
excision, a process in which homologous recombination between the two alleles results 
in looping out of the plasmid sequences. Cells that have undergone such an excision 
event and have lost plasmid sequences including sacB gene are selected for by addition 

15 of sucrose to the medium. The sacB gene product converts sucrose to a toxic molecule. 

Thus counter selection with sucrose ensures that plasmid sequences are no longer 
present in the cell. Loss of plasmid sequences is further confirmed by testing for 
sensitivity to chloramphenicol (loss of the chloramphenicol resistance gene). The latter 
test is important because occasionally a mutation in the sacB gene can occur resulting in 

20 a loss of sacB function with no effect on plasmid replication (Link, et. al, 1997 J. 

BactehoL 179 (20): 6228-6237). These artifact clones retain plasmid sequences and are 
therefore still resistant to chloramphenicol. 

In the process of plasmid excision, one of the two rpsL alleles is lost from the 
chromosome along with the plasmid DNA. In general, it is equally likely that the null 

25 allele or the wild type allele will be lost. Therefore, if the rpsL gene is not essential, 

half of the clones obtained in this experiment will have the wild type allele on the 
chromosome and half will have the null allele. However, if the rpsL gene is essential, 
cells containing the null allele will not be obtained as a single copy of the null allele 
would be lethal. 

30 To determine the essentiality of rpsL, a statistically significant number of the 

resulting clones, at least 20, are analyzed by PCR amplification of the rpsL gene. Since 
the null allele is missing a significant portion of the rpsL gene, its PCR product is 
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significantly shorter than that of the wild type gene and the two are readily 
distinguished by gel electrophoretic analysis. The PCR products may also be subjected 
to sequence determination for further confirmation by methods well known to those in 
the art. 

5 The above experiment is generally adequate for determining the essentiality of a 

gene such as rpsL. How^ever, it may be necessary or desirable to more directly confirm 
the essentiality of the gene. There are several methods by which this can be 
accomplished. In general, these involve three steps: 1) construction of an episome 
containing a wild type allele, 2) isolation of clones containing a single chromosomal 

10 copy of the mutant null allele as described above but in the presence of the episomal 

wild type allele, and then 3) determining if the cells survive when the expression of the 
episomal allele is shut off In this case, the trans copy of wild type rpsL is made by 
PCR cloning of the entire coding region of rpsL and inserting it in the sense orientation 
downstream of an inducible promoter such as the £. coli lac promoter. Transcription of 

15 this allele of rpsL will be induced in the presence of IPTG which inactivates the lac 

repressor. Under IPTG induction rpsL protein will be expressed as long as the 
recombinant gene also possesses a ribosomal binding site, also known as a "Shine- 
Dalgamo Sequence". The trans copy of ipsL is cloned on a plasmid that is compatible 
with pSClOl. Compatible vectors include pl5A, pBR322, and the pUC plasmids, 

20 among others. Replication of the compatible plasmid will not be temperature-sensitive. 

The entire process of integrating the null allele of rpsL and subsequent plasmid excision 
is carried out in the presence of IPTG to ensure the expression of functional rpsL 
protein is maintained tliroughout. After the null rpsL allele is confirmed as integrated 
on the chromosome in place of the wild type rpsL allele, then IPTG is withdraw^n and 

25 expression of functional rpsL protein shut off If the rpsL gene is essential, cells will 

cease to proliferate under these conditions. However, if the rpsL gene is not essential 
cells will continue to proliferate under these conditions. In this experiment, essentiality 
is determined by conditional expression of a wild type copy of the gene rather than 
inability to obtain the intended chromosomal disruption. 

30 An advantage of this method over some other gene disruption tecliniques is that 

the targeted gene can be deleted or mutated without the introduction of large segments 
of foreign DNA. Therefore, polar effects on downstream genes are eliminated or 
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minimized. There are methods described to introduce inducible promoters upstream of 
potential essential bacterial genes. However in such cases, polarity from multiple 
transcription start points can be a problem. One way of preventing this is to insert a 
gene disruption cassette that contains strong transcriptional terminators upstream of the 
5 integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J. Bacterial 178 (12); 

3614-3620). The described techniques will all be familiar to one of ordinary skill in the 
art. 

Following the analysis of the rpsL gene, the other genes of the operon are 

investigated to determine if they are required for proliferation. 

10 EXAMPLE 6 

Expression of the Proteins Encoded by Genes Identified as Required for E. coli 

Proliferation 

The following is provided as one exemplary method to express the proliferation- 
required proteins encoded by the identified sequences described above. First, the initiation 

15 and termination codons for the gene are identified. If desired, methods for improving 

translation or expression of the protein are well known in the art. For example, if the 
nucleic acid encoding the polypeptide to be expressed lacks a methionine codon to serve as 
the initiation site, a strong Shine-Delgamo sequence, or a stop codon, these sequences can 
be added. Similarly, if the identified nucleic acid sequence lacks a transcription 

20 termination signal, this sequence can be added to the construct by, for example, splicing 

out such a sequence from an appropriate donor sequence. In addition, the coding sequence 
may be operably Imked to a strong promoter or an inducible promoter if desired. The 
identified nucleic acid sequence or portion thereof encoding the polypeptide to be 
expressed is obtained by PCR from the bacterial expression vector or genome using 

25 oligonucleotide primers complementary to the identified nucleic acid sequence or portion 

thereof and containing restriction endonuclease sequences for Ncol incorporated into the 5' 
primer and 5g/II at the 5' end of the corresponding 3 '-primer, taking care to ensure that the 
identified nucleic acid sequence is positioned in frame with the termination signal. The 
purified fragment obtained from the resulting PCR reaction is digested with Ncol and 

30 Bglil, purified and ligated to an expression vector. 

The ligated product is transformed into DH5a or some other E. coli strain suitable 
for the over expression of potential proteins. Transformation protocols are well known in 
the art. For example, transformation protocols are described in: Current Protocols in 
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Molecular Biology, Vol. 1, Unit 1.8, (Ausubel, et al., Eds.) John Wiley & Sons, Inc. 
(1997). Positive transformants are selected after growing the transformed cells on plates 
containing 50-100 |ig/ml Ampicillin (Sigma, St. Louis, Missouri). In one embodiment, the 
expressed protein is held in the cytoplasm of the host organism. In an alternate 
5 embodiment, the expressed protein is released into the culture medium. In still another 

alternative, the expressed protein can be sequestered in the periplasmic space and liberated 
therefrom using any one of a number of cell lysis techniques known in the art. For 
example, the osmotic shock cell lysis method described in Chapter 16 of Current 
Protocols in Molecular Biology, Vol. 2, (AusubeK et al., Eds.) Jolin Wiley & Sons, Inc. 

10 (1997). Each of these procedures can be used to express a proliferation-required protein. 

Expressed proteins, whether in the culture medium or liberated from the 
periplasmic space or the cytoplasm, are then purified or enriched from the supernatant 
using conventional techniques such as ammonium sulfate precipitation, standard 
chromatography, immunoprecipitation, imniunochroniatography, size exclusion 

15 chromatography, ion exchange cliromatography, and HPLC. Alternatively, the secreted 

protein can be in a sufficiently enriched or pure state in the supernatant or growth media of 
the host to permit it to be used for its intended purpose without further enrichment. The 
purity of the protein product obtained can be assessed using techniques such as Coomassie 
or silver staining or using antibodies against the control protein. Coomassie and silver 

20 staining techniques are familiar to those skilled in the art. 

Antibodies capable of specifically recognizing the protein of interest can be 
generated using synthetic peptides using methods well known in the art. See, Antibodies: 
A Laboratory Manual, (Harlow and Lane, Eds.) Cold Spring Harbor Laboratory (1988). 
For example, 15-mer peptides having a sequence encoded by the appropriate identified 

25 gene sequence of interest or portion thereof can be chemically synthesized. The synthetic 

peptides are injected into mice to generate antibodies to the polypeptide encoded by the 
identified nucleic acid sequence of interest or portion thereof. Alternatively, samples of 
the protein expressed from the expression vectors discussed above can be purified and 
subjected to amino acid sequencing analysis to confirm the identity of the recombinantly 

30 expressed protein and subsequently used to raise antibodies. An Example describing in 

detail the generation of monoclonal and polyclonal antibodies appears in Example 7. 



-64- 




The protein encoded by the identified nucleic acid sequence of interest or portion 
thereof can be purified using standard immunocliromatography techniques. In such 
procedures, a solution containing the secreted protein, such as the culture medium or a cell 
extract, is applied to a column having antibodies against the secreted protein attached to 
5 the chromatography matrix. The secreted protein is allowed to bind the 

immunochromatography column. Thereafter, the column is washed to remove non- 
specifically bound proteins. The specifically bound secreted protein is then released from 
the column and recovered using standard techniques. These procedures are well known in 
the art. 

10 In an altemative protein purification scheme, the identified nucleic acid sequence 

of interest or portion thereof can be incorporated into expression vectors designed for use 
in purification schemes employing chimeric polypeptides. In such strategies the coding 
sequence of the identified nucleic acid sequence of interest or portion thereof is inserted in- 
frame with the gene encoding the other half of the chimera. The other half of the chimera 

1 5 can be maltose binding protein (MBP) or a nickel binding polypeptide encoding sequence. 

A chromatography matrix having antibody to MBP or nickel attached thereto is then used 
to purify the cliimeric protein. Protease clea\age sites can be engineered between the 
MBP gene or the nickel binding polypeptide and the identified expected gene of interest, 
or portion thereof. Thus, the two poh peptides of the chimera can be separated from one 

20 another by protease digestion. 

One useful expression vector for generating maltose binding protein fusion 
proteins is pMAL (New England Biolabs), which encodes the malE gene. In the pMal 
protein fusion system, the cloned gene is inserted into a pMal vector downstream from the 
malE gene. This results in the expression of an MBP-fusion protein. The fusion protein is 

25 purified by affinity chromatography. These techniques as described are well Ivnowii to 

those skilled in the art of molecular biology. 

EXAMPLE 7 
Production of an Antibody to an isolated E. coli Protein 
Substantially pure protein or polypeptide is isolated from the transformed cells as 

30 described in Example 6. The concentration of protein in the final preparation is adjusted, 

for example, by concentration on a 10,000 molecular w^eight cut off AMICON filter device 
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(Millipore, Bedford, MA), to the level of a few micrograms/ml. Monoclonal or polyclonal 
antibody to the protein can then be prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 
Monoclonal antibody to epitopes of any of the peptides identified and isolated as 
5 described can be prepared from murine hybridomas according to the classical method of 

Kohler, G. and Milstein, C, Nature 256:495 (1975) or any of the well-kiiown derivative 
methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the 
selected protein or peptides derived therefrom over a period of a few wrecks. The mouse is 
then sacrificed, and the antibody producing cells of the spleen isolated. The spleen cells 

10 are fused by means of polyethylene glycol witli mouse myeloma cells, and the excess 

unfused cells destroyed by growth of the system on selective media comprising 
aminopterin (HAT media). The successfully fused cells are diluted and aliquots of the 
dilution placed in wells of a microtiter plate where growth of the culture is continued. 
Antibody-producing clones are identified by detection of antibody in the supernatant fluid 

15 of the wells by immunoassay procedures, such as ELISA, as described by EngvalL E., 

"Enzyme immunoassay ELISA and EMIT," Meth. Enzj^ol. 70:419 (1980), and 
derivative methods thereof Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. Detailed procedures for monoclonal 
antibody production are described in Davis, L. et al. Basic Methods in Molecular Biolog>^ 

20 Elsevier, New York. Section 21-2. 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single 
protein or a peptide can be prepared by immunizing suitable animals with the expressed 
protein or peptides derived therefrom described above, which can be unmodified or 

25 modified to enliance immunogenicity. Effective polyclonal antibody production is 

affected by many factors related both to the antigen and the host species. For example, 
small molecules tend to be less immunogenic than larger molecules and can require the use 
of carriers and adjuvant. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. 

30 Small doses (ng level ) of antigen administered at multiple intradermal sites appears to be 

most reliable. An effective immunization protocol for rabbits can be found in Vaitukaitis, 
J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971). 
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Booster injections can be given at regular intervals, and antiserum har\^ested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double 
immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, 
for example, Ouchterlony, O. et al., Chap. 19 in: Handbook of Experimental 
5 Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of antibody is usually 

in the range of 0.1 to 0.2 mg/ml of serum ( about 12 i^M). Affmity of the antisera for the 
antigen is determined by preparing competitive binding curves, as described, for example, 
by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, 
Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980). 

10 Antibody preparations prepared according to either protocol are useful in 

quantitative immunoassays which determine concentrations of antigen-bearing substances 
in biological samples; they are also used semi-quantitatively or qualitatively to identify the 
presence of antigen in a biological sample. The antibodies can also be used in therapeutic 
compositions for killing bacterial cells expressing the protein. 

15 EXAMPLES 

Screening Chemical Libraries 

A. Protein-Based Assavs 

Having isolated and expressed bacterial proteins shown to be required for bacterial 
proliferation, the present im^ention further contemplates the use of these expressed proteins 

20 in assays to screen libraries of compounds for potential drug candidates. The generation of 

chemical libraries is well known in the art. For example combinatorial chemistrj^ can be 
used to generate a library of compounds to be screened in the assays described herein. A 
combinatorial chemical library is a collection of diverse chemical compounds generated by 
either chemical synthesis or biological synthesis by combining a number of chemical 

25 "building blocks" reagents. For example, a linear combinatorial chemical library such as a 

polypeptide librar}^ is formed by combining amino acids in every possible combination to 
yield peptides of a given length. Millions of chemical compounds theoretically can be 
synthesized through such combinatorial mixings of chemical building blocks. For 
example, one commentator observed that the systematic, combinatorial mixing of 100 

30 interchangeable chemical building blocks results in the theoretical synthesis of 100 million 

tetrameric compounds or 10 billion pentameric compounds. (Gallop et al., ''Applications 
of Combinatorial Technologies to Drug Discovery, Background and Peptide 
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Combinatorial Libraries," Journal of Medicinal Chemistry, Vol. 37, No. 9. 1233-1250 
(1994), Other chemical libraries known to those in the art may also be used, including 
natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that 
5 possess desirable biological properties. For example, compounds which may be useful as 

drugs or to develop drugs would likely have the ability to bind to the target protein 
identified, expressed and purified as discussed above. Further, if the identified target 
protein is an enzy^me, candidate compounds would likely interfere with the enzymatic 
properties of the target protein. Any enzyme can be a target protein. For example, the 

10 enzymatic function of a target protein can be to serv e as a protease, nuclease, phosphatase, 

dehydrogenase, transporter protein, transcriptional enzyme, and any other type of 
enzyme known or unknown. Thus, the present invention contemplates using the protein 
products described above to screen combinatorial chemical libraries. 

Those in the art will appreciate that a number of techniques exist for characterizing 

15 target proteins in order to identify molecules useful for the discover}^ and development of 

therapeutics. For example, some techniques involve the generation and use of small 
peptides to probe and analyze target proteins both biochemically and genetically in order 
to identify and develop drug leads. Such techniques include the methods described in PCT 
publications No. W09935494, W09819162, W09954728, the disclosures of which are 

20 incorporated herein by reference in their entireties. 

In another example, the target protein is a serine protease and the substrate of the 
enz>^me is known. The present example is directed tow^ards the analysis of libraries of 
compounds to identify compounds that function as inhibitors of the target enzyme. First, a 
library of small molecules is generated using methods of combinatorial library formation 

25 well known in the art, U.S. Patent NOs. 5,463,564 and 5,574, 656, to Agrafiotis, et al., 

entitled "System and Method of Automatically Generating Chemical Compound with 
Desired Properties," are two such teachings. Then the library compounds are screened to 
identify library compounds that possess desired structural and functional properties. U.S. 
Patent No. 5,684,71 1 also discusses a method for screening libraries. 

30 To illustrate the screening process, the combined target and chemical compounds 

of the library are exposed to and permitted to interact with the purified enzyme. A labeled 
substrate is added to the incubation. The label on the substrate is such that a detectable 
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signal is emitted from metabolized substrate molecules. The emission of this signal 
permits one to measure the effect of the combinatorial library compounds on the 
enzymatic activity of target enzymes. The characteristics of each librar}^ compound is 
encoded so that compounds demonstrating activity against the enz\aiie can be analyzed 
5 and features common to the various compounds identified can be isolated and combined 

into future iterations of Hbraries. 

Once a library of compounds is screened, subsequent libraries are generated using 
those chemical building blocks that possess the features shown in the first round of screen 
to have activity against the target enzyme. Using this method, subsequent iterations of 

10 candidate compounds will possess more and more of those structural and functional 

features required to inhibit the function of the target enzyme, until a group of enzyme 
inliibitors with high specificity for the enz)'me can be found. These compounds can then 
be further tested for their safety and efficacy as antibiotics for use in mammals. 

It will be readily appreciated that this particular screening methodology is 

15 exemplary only. Other methods are well known to those skilled in the art. For 

example, a wide variety of screening tecliniques are known for a large number of 
naturally-occurring targets when the biochemical function of the target protein is 
knouTi. 

B. Cell Based Assays 

20 Current cell-based assays used to identify or to characterize compounds for drug 

discovery and development frequently depend on detecting the ability of a test 
compound to inhibit the activity of a target molecule located within a cell or located on 
the surface of a cell. Most often such target molecules are proteins such as enzymes, 
receptors and the like. However, target molecules may also include other molecules 

25 such as DNAs, lipids, carbohydrates and RNAs including messenger RNAs, ribosomal 

RNAs, tRNAs and the like. A number of highly sensitive cell-based assay methods are 
available to those of skill in the art to detect binding and interaction of test compounds 
with specific target molecules. However, these methods are generally not highly 
effective when the test compound binds to or otherwise interacts with its target 

30 molecule with moderate or low affinity. In addition, the target molecule may not be 

readily accessible to a test compound in solution, such as when the target molecule is 
located inside the cell or within a cellular compartment such as the periplasm of a 
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bacterial cell. Thus, current cell-based assay methods are limited in that they are not 
effective in identifying or characterizing compounds that interact with their targets with 
moderate to low affinity or compounds that interact with targets that are not readily 
accessible. 

5 Cell-based assay methods of the present invention have substantial advantages 

over current cell-based assays practiced in the art. These advantages derive from the 
use of sensitized cells in which the level or activity of a proliferation-required gene 
product (the target molecule) has been specifically reduced to the point wiiere the 
presence or absence of its function becomes a rate-determining step for cellular 

10 proliferation. Bacterial, fungal, plant, or animal cells can all be used with the present 

method. Such sensitized cells become much more sensitive to compounds that are 
active against the affected target molecule. Thus, cell-based assays of the present 
invention are capable of detecting compounds exhibiting low or moderate potency 
against the target molecule of interest because such compounds are substantially more 

15 potent on sensitized cells than on non-sensitized cells. The affect may be such that a 

test compound may be two to several times more potent, at least 10 times more potent 
or even at least 100 times more potent when tested on the sensitized cells as compared 
to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic 

20 microorganisms and to the significant side-effects associated with some currently used 

antibiotics, novel antibiotics acting at new targets are highly sought after in the art. Yet, 
another limitation in the current art related to cell-based assays is the problem of 
identifying hits against the same kinds of target molecules in the same limited set of 
biological pathways over and over again. This may occur when compounds acting at 

25 such new targets are discarded, ignored or fail to be detected because compounds acting 

at the "old" targets are encountered more frequently and are more potent than 
compounds acting at the new targets. As a result, the majority of antibiotics in use 
currently interact with a relatively small number of target molecules within an even 
more limited set of biological pathways. 

30 The use of sensitized cells of the current invention provides a solution to the 

above problem in two ways. First, desired compounds acting at a target of interest, 
whether a new target or a previously know^n but poorly exploited target, can now be 
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detected above the ''noise" of compounds acting at the "old" targets due to the specific 
and substantial increase in potency of such desired compounds when tested on the 
sensitized cells of the current invention. Second, the methods used to sensitize cells to 
compounds acting at a target of interest may also sensitize these cells to compounds 
5 acting at other target molecules within the same biological pathway. For example, 

expression of an antisense molecule to a gene encoding a ribosomal protein is expected 
to sensitize the cell to compounds acting at that ribosomal protein and may also 
sensitize the cells to compounds acting at any of the ribosomal components (proteins or 
rRNA) or even to compounds acting at any target which is part of the protein synthesis 
10 pathway. Thus an important advantage of the present invention is the ability to reveal 

new targets and pathw^ays that were previously not readily accessible to drug discovery 
methods. 

Sensitized cells of the present invention are prepared b} reducing the activity or 
level of a target molecule. The target molecule may be a gene product, such as an RNA 

15 or polypeptide produced from the proliferation-required nucleic acids described herein. 

Alternatively, the target may be a gene product such as an RNA or polypeptide w^hich is 
produced form a sequence within the same operon as the proliferation-required nucleic 
acids described herein. In addition, the target may be an RNA or polypeptide in the 
same biological pathway as the proliferation-required nucleic acids described herein. 

20 Such biological pathways include, but are not limited to. enzymatic, biochemical and 

metabolic pathways as w^ell as pathways involved in the production of cellular 
structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial 
chemistries are able to make use of structure-activity relationship information derived 

25 from testing compounds in various biological assays including direct binding assays and 

cell-based assays. Occasionally compounds are directly identified in such assays that 
are sufficiently potent to be developed as drugs. More often, initial hit compounds 
exhibit moderate or low potency. Once a hit compound is identified with low or 
moderate potency, directed libraries of compounds are synthesized and tested in order 

30 to identify more potent leads. Generally these directed libraries are combinatorial 

chemical libraries consisting of compounds with structures related to the hit compound 
but containing systematic variations including additions, subtractions and substitutions 
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of various structural features. When tested for activity against the target molecule, 
structural features are identified that either alone or in combination with other features 
enhance or reduce activity. This information is used to design subsequent directed 
libraries containing compounds with enhanced activity against the target molecule. 
5 After one or several iterations of this process, compounds with substantially increased 

activity against the target molecule are identified and may be further developed as 
drugs. This process is facilitated by use of the sensitized cells of the present invention 
since compounds acting at the selected targets exhibit increased potency in such cell- 
based assays, thus; more compounds can now be characterized providing more useful 

10 information than would be obtained otherwise. 

Thus, it is now possible using cell-based assays of the present invention to 
identify or characterize compounds that previously would not have been readily 
identified or characterized including compounds that act at targets that previously were 
not readily exploited using cell-based assays. The process of evolving potent drug leads 

15 from initial hit compounds is also substantially improved by the cell-based assays of the 

present invention because, for the same number of test compounds, more structure- 
function relationship information is likely to be revealed. 

The method of sensitizing a cell entails selecting a suitable gene or operon. A 
suitable gene or operon is one whose expression is required for the proliferation of the 

20 cell to be sensitized. The next step is to introduce into the cells to be sensitized, an 

antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA 
encoded by the suitable gene or operon. Introduction of the antisense RNA can be in 
the form of an expression vector in which antisense RNA is produced under the control 
of an inducible promoter. The amount of antisense RNA produced is limited by varying 

25 the inducer concentration to wWch the cell is exposed and thereby varying the activity 

of the promoter driving transcription of the antisense RNA. Thus, cells are sensitized 
by exposing them to an inducer concentration that results in a sub-lethal level of 
antisense RNA expression. 

In one embodiment of the cell-based assays, the identified exogenous E. coli 

30 nucleotide sequences of the present invention are used to inhibit the production of a 

proliferation-required protein. Expression vectors producing antisense RNA against 
identified genes required for proliferation are used to limit the concentration of a 
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proliferation-required protein without severly inhibiting growth. To achieve that goal, a 
growth inhibition dose curve of inducer is calculated by plotting various doses of 
inducer against the corresponding growth inhibition caused by the antisense expression. 
From this curve, various percentages of antisense induced growth inhibition, from 1 to 
5 100% can be determined. If the promoter contained in the expression vector contains a 

lac operator the transcription is regulated by lac repressor and expression from the 
promoer is inducible with IPTG. For example, the highest concentration of the inducer 
IPTG that does not reduce the grouth rate (0% growth inhibition) can be predicted from 
the curve. Cellular proliferation can be monitored by growth medium turbidity via OD 

10 measurements. In another example, the concentration of inducer that reduces growth by 

25% can be predicted from the curve. In still another example, a concentration of 
inducer that reduces growth by 50% can be calculated. Additional parameters such as 
colony forming units (cfu) can be used to measure cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of 

15 inducer. The presence of the inducer at this sub-lethal concentration reduces the 

amount of the proliferation required gene product to the lowest amount in the cell that 
will support growth. Cells grown in the presence of this concentration of inducer are 
therefore specifically more sensitive to inhibitors of the proliferation-required protein or 
RNA of interest or to inhibitors of proteins or RNAs in the same biological pathw^ay as 

20 the proliferation-required protein or RNA of interest but not to inhibitors of unrelated 

proteins or RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus containing 
a reduced amount of proliferation-required target gene product are then used to screen 
for compounds that reduce cell growth. The sub-lethal concentration of inducer may be 

25 any concentration consistent with the intended use of the assay to identify candidate 

compounds to which the cells are more sensitive. For example, the sub-lethal 
concentration of the inducer may be such that growth inhibition is at least about 5%, at 
least about 8%, at least about 10%, at least about 20%, at least about 30%, at least about 
40%, at least about 50%), at least about 60% at least about 75%, or more. Cells which 

30 are pre-sensitized using the preceding method are more sensitive to inhibitors of the 

target protein because these cells contain less target protein to inhibit than wild-type 
cells. 
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In another embodiment of the cell based assays of the present invention, the level or 
activity of a proliferation required gene product is reduced using a temperature sensitive 
...mutation in the proliferation-required sequence and an antisense nucleic acid against 
the proliferation-required sequence. Growing the cells at an intermediate temperature 
5 between the permissive and restrictive temperatures of the temperature sensitive mutant 

where the mutation is in a proliferation-required gene produces cells with reduced 
activity of the proliferation-required gene product. The antisense RNA directed against 
the proliferation-required sequence further reduces the activity of the proliferation 
required gene product. Drugs that may not have been found using either the 

10 temperature sensitive mutation or the antisense nucleic acid alone may be identified by 

determining whether cells in w^hich expression of the antisense nucleic acid has been 
induced and which are grown at a temperature between the permissive temperature and 
the restrictive temperature are substantially more sensitive to a test compound than cells 
in which expression of the antisense nucleic acid has not been induced and which are 

15 grown at a permissive temperature. Also drugs found previously from either the 

antisense nucleic acid alone or the temperature sensitive mutation alone may have a 
different sensitivity profile w^hen used in cells combining the two approaches, and that 
sensitivity profile may indicate a more specific action of the drug in inhibiting one or 
more activities of the gene product. 

20 Temperature sensitive mutations may be located at different sites within the 

gene and correspond to different domains of the protein. For example, the dnaB gene of 
Escherichia coli encodes the replication fork DNA helicase. DnaB has several 
domains, including domains for oligomerization, ATP hydrolysis, DNA binding, 
interaction with primase, interaction with DnaC, and interaction with DnaA [(Biswas, 

25 E.E. and Biswas, S.B. 1999. Mechanism and DnaB helicase of Escherichia coli: 

structural domains involved in ATP hydrolysis, DNA binding, and oligomerization. 
Biochem. 38:10919-10928; Hiasa, H. and Marians, K.J. 1999. Initiation of bidirectional 
replication at the chromosomal origin is directed by the interaction between helicase 
and primase. J. Biol. Chem. 274:27244-27248; San Martin, C, Radermacher, M., 

30 Wolpensinger, B., Engel, A., Miles, C.S., Dixon. N.E.. and Carazo, J.M. 1998. Three- 

dimensional reconstructions from cryoelectron microscopy images reveal an intimate 
complex between helicase DnaB and its loading partner DnaC. Structure 6:501-9; 
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Sutton, M.D., Carr, K.M., Vicente, M., and Kaguni, J.M. 1998. Escherichia coli DnaA 
protein. The N-terminal domain and loading of DnaB helicase at the £. coli 
chromosomal. J. Biol. Chem. 273:34255-62.), the disclosures of which are incorporated 
herein by reference in their entireties]. Temperature sensitive mutations in different 
5 domains of DnaB confer different phenotypes at the restrictive temperature, which 

include either an abrupt stop or slow stop in DNA replication with or without DNA 
breakdown (Wechsler, J. A. and Gross, J.D. 1971. Escherichia coli mutants temperature- 
sensitive for DNA synthesis. Mol, Gen. Genetics 113:273-284, the disclosure of which 
is incorporated herein by reference in its entirety) and termination of growlh or cell 

10 death. Combining the use of temperature sensitive mutations in the dnuB gene that 

cause cell death at the restrictive temperature with an antisense to the chiaB gene could 
lead to the discover}^ of very specific and effective inhibitors of one or a subset of 
activities exhibited by DnaB. 

When screening for antimicrobial agents against a gene product required for 

15 proliferation, growth inhibition of cells containing a limiting amount of that 

proliferation-required gene product can be assayed. Growth inhibition can be measured 
by directly comparing the amount of growth, measured by the optical density of the 
growth medium, between an experimental sample and a control sample. Alternative 
methods for assaying cell proliferation include measuring green fluorescent protein 

20 (GFP) reporter construct emissions, various enzymatic activity assays, and other 

methods well known in the art. 

It will be appreciated that the above method may be performed in solid phase, 
liquid phase or a combination of the two. For example, cells grown on nutrient agar 
containing the inducer of the antisense construct may be exposed to compounds spotted 

25 onto the agar surface. A compound's effect may be judged from the diameter of the 

resulting killing zone, the area around the compound application point in wliich cells do 
not grow. Multiple compounds may be transferred to agar plates and simultaneously 
tested using automated and semi-automated equipment including but not restricted to 
multi-channel pipettes (for example the Beckman Multimek) and multi-channel spotters 

30 (for example the Genomic Solutions Flexys). In this way multiple plates and thousands 

to millions of compounds may be tested per day. 
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The compounds may also be tested entirely in liquid phase using microtiter 
plates as described below. Liquid phase screening may be performed in microtiter 
plates containing 96, 384, 1536 or more wells per microtiter plate to screen multiple 
plates and thousands to millions of compounds per day. Automated and semi- 
5 automated equipment may be used for addition of reagents (for example cells and 

compounds) and determination of cell density. 

EXAMPLE 9 

The effectiveness of the above cell based assay w^as validated using constructs 
expressing antisense RNA to coli genes rplL, rplJ, and rplW encoding ribosomal 

1 0 proteins L7/L12, LIO and L23 respectively. These proteins are part of the protein 

synthesis apparatus of the cell and as such are required for proliferation. These 
constructs were used to test the effect of antisense expression on cell sensitivity to 
antibiotics known to bind to the ribosome and thereby inhibit protein synthesis. 
Constructs expressing antisense RNA to several other genes (elaD, visC, yohH, and 

15 aptE/B), the products of which are not involved in protein synthesis w^ere used for 

comparison. 

First expression vectors containing antisense constructs to either rplW or to elaD 
w^ere introduced into separate E. coli cell populations. Vector introduction is a 
technique well known to those of ordinary skill in the art. The expression vectors of 

20 this example contain IPTG inducible promoters that drive the expression of the 

antisense RNA in the presence of the inducer. How^ever, those skilled in the art w ill 
appreciate that other inducible promoters may also be used. Suitable expression vectors 
are also well known in the art. The E. coli antisense clones encoding ribosomal proteins 
L7/L12, LIO and L23 were used to test the effect of antisense expression on cell 

25 sensitivity to the antibiotics known to bind to these proteins. First, expression vectors 

containing antisense to either the genes encoding L7/L12 and LIO or L23 were 
introduced into separate E. coli cell populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid 
medium to obtain the growth inhibitory dose curve for each clone (Fig. 1). First, seed 

30 cultures were grown to a particular turbidity that is measured by the optical density 

(OD) of the growth solution. The OD of the solution is directly related to the number of 
bacterial cells contained therein. Subsequently, sixteen 200 ul liquid medium cultures 
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were grown in a 96 well microtiter plate at 37 C with a range of IPTG concentrations in 
duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). 
Additionally, control cells were grown in duplicate without IPTG. These cultures were 
started from equal amounts of cells derived from the same initial seed culture of a clone 
5 of interest. The cells were grown for up to 15 hours and the extent of growth was 

determined by measuring the optical density of the cultures at 600 nm. When the 
control culture reached mid-log phase the percent growth of the control for each of the 
IPTG containing cultures was plotted against the log concentrations of IPTG to produce 
a growth inhibitory dose response curve for the IPTG. The concentration of IPTG that 

10 inhibits cell growth to 50% (IC50) as compared to the 0 mM IPTG control (0% growth 

inhibition) was then calculated from the curve. Under these conditions, an amount of 
antisense RNA w^as produced that reduced the expression levels of rplW and elaD to a 
degree such that growth was inhibited by 50%. 

Alternative methods of measuring growth are also contemplated. Examples of 

15 these methods include measurements of proteins, the expression of which is engineered 

into the cells being tested and can readily be measured. Examples of such proteins 
include green fluorescent protein (GPP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to 
test the sensitivity of cell populations to tetracycline, erythromycin and other protein 

20 synthesis inhibitors. An example of a tetracycline dose response curve is shown in 

Figures 2A and 2B for the rplW and elaD genes, respectively. Cells were grown to log 
phase and then diluted into media alone or media containing IPTG at concentrations 
which give 20% and 50% growth inhibition as determined by IPTG dose response 
curves. After 2.5 hours, the cells were diluted to a final OD600 of 0.002 into 96 well 

25 plates containing (1) +/- IPTG at the same concentrations used for the 2.5 hour pre- 

incubation; and (2) serial tw^o-fold dilutions of tetracycline such that the final 
concentrations of tetracycline range from 1 |ig/ml to 15.6 ng/ml and 0 i^g/ml. The 96 
well plates were incubated at 37°C and the OD600 was read by a plate reader every 5 
minutes for up to 15 hours. For each IPTG concentration and the no IPTG control, 

30 tetracycline dose response curves were determined when the control (absence of 

tetracycline) reached 0,1 OD600. To compare tetracycline sensitivity with and without 
IPTG, tetracycline IC50s were determined from the dose response curves (Figs. 2A-B). 
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Cells with reduced levels of L23 (rplW) showed increased sensitivity to tetracycline 
(Fig. 2A) as compared to cells with reduced levels of elaD (Fig. 2B). Figure 3 shows a 
summary bar chart in which the ratios of tetracycline IC50s determined in the presence 
of IPTG which gives 50% growth inhibition versus tetracycline ICSOs determined 
5 without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells with reduced 

levels of either L7/L12 (genes rplL, rplJ) or L23 (rplW) showed increased sensitivity to 
tetracycline (Fig. 3). Cells expressing antisense to genes not known to be involved in 
protein synthesis (atpB/E, visC, elaD, yohH) did not show the same increased 
sensitivity to tetracycline, validating the specificity of this assay (Fig. 3 ). 

1 0 In addition to the above, it has been observed in initial experiments that clones 

expressing antisense RNA to genes involved in protein synthesis (including genes 
encoding ribosomal proteins L7/L12 & LIO, L7/L12 alone, L22, and LI 8, as well as 
genes encoding rRNA and Elongation Factor G) have increased sensitivity to the 
macrolide, erythromycin, whereas clones expressing antisense to the non-protein 

15 synthesis genes elaD, atpB/E and visC do not. Furthermore, the clone expressing 

antisense to rplL and rplJ does not show increased sensitivity to nalidixic acid and 
ofloxacin, antibiotics which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rplL, rplJ, and rplW as well as the 
initial results using various other antisense clones and antibiotics show that limiting the 

20 concentration of an antibiotic target makes cells more sensitive to the antimicrobial 

agents that specifically interact with that protein. The results also show that these cells 
are sensitized to antimicrobial agents that inhibit the overall function in which the 
protein target is involved but are not sensitized to antimicrobial agents that inhibit other 
functions. 

25 The cell based assay described above may also be used to identify the biological 

pathway in which a proliferation-required nucleic acid or its gene product lies. In such 
methods, cells expressing a sub-lethal level of antisense to a target proliferation- 
required nucleic acid and control cells in which expression of the antisense has not been 
induced are contacted with a panel of antibiotics known to act in various pathways. If 

30 the antibiotic acts in the patlway in which the target proliferation-required nucleic acid 

or its gene product lies, cells in which expression of the antisense has been induced will 
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be more sensitive to the antibiotic than cells in which expression of the antisense has 
not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of 
cells expressing antisense nucleic acids to many different proliferation-required genes 
5 including the target proliferation-required gene. If the antibiotic is acting specifically, 

heightened sensitivity to the antibiotic will be observed only in the cells expressing 
antisense to a target proliferation-required gene (or cells expressing antisense to other 
proliferation-required genes in the same pathway as the target proliferation-required 
gene) but will not be observed generally in all cells expressing antisense to 
10 proliferation-required genes. 

Similarly, the above method may be used to determine the pathway on which a 
test antibiotic acts. A panel of cells, each of which expresses antisense to a 
proliferation-required nucleic acid in a known pathway, is contacted with a compound 
for which it is desired to determine the pathway on which it acts. The sensitivity of the 
15 panel of cells to the test compound is determined in cells in which expression of the 

antisense has been induced and in control cells in which expression of the antisense has 
not been induced. If the test antibiotic acts on the pathway on which an antisense 
nucleic acid acts, cells in which expression of the antisense has been induced will be 
more sensitive to the antibiotic than cells in which expression of the antisense has not 
20 been induced. In addition, control cells in w^hich expression of antisense to 

proliferation-required genes in other pathways has been induced will not exhibit 
heightened sensitivity to the antibiotic. In this way, the pathway on which the test 
antibiotic acts may be determined. 

The Example below provides one method for performing such assays, 
25 EXAMPLE 10 

Identification of thePathway in which a Proliferation-Required 
Gene Lies or the Pathway on wiiich an Antibiotic Acts 
A. Preparation of Bacterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria 
30 containing the desired antisense construct are prepared using standard microbiological 

techniques. For example, a single clone of the organism can be isolated by streaking out 
a sample of the original stock onto an agar plate containing nutrients for cell growlh and 
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an antibiotic for which the antisense construct contains a gene which confers resistance. 
After overnight growth an isolated colony is picked from the plate with a sterile needle 
and transferred to an appropriate liquid growth media containing the antibiotic required 
for maintenance of the plasmid. The cells are incubated at 30°C to SV^C with vigorous 
5 shaking for 4 to 6 hours to yield a culture in exponential growth. Sterile glycerol is 

added to 15% (volume to volume) and 100)aL to 500 )iL aliquots are distributed into 
sterile cryotubes, snap frozen in liquid nitrogen, and stored at -80°C for future assays. 

B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thaw^ed 
10 (37°C w^ater bath) and a loop of culture is streaked out on an agar plate containing 

nutrients for cell growth and an antibiotic to which the antisense construct confers 
resistance. After overnight growth at 37°C, ten randomly chosen, isolated colonies are 
transferred from the plate (sterile inoculum loop) to a sterile tube containing 5 niL of 
LB medium containing the antibiotic to which the antisense vector confers resistance. 
15 After vigorous mixing to form a homogeneous cell suspension, the optical density of the 

suspension is measured at 600 nm (OD600) and if necessary an aliquot of the 
suspension is diluted into a second tube of 5 mL. sterile, LB medium plus antibiotic to 
achieve an OD600 < 0.02 absorbance units. The culture is then incubated at 37"^ C for 
1-2 hrs with shaking until the OD600 reaches OD 0.2 - 0.3. At this point the cells are 
20 ready to be used in the assay. 

C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing 
the appropriate antibiotic for maintenance of the antisense construct. Several media are 
tested side by side and three to four w^ells are used to evaluate the effects of the inducer 

25 at each concentration in each media. For example, M9 minimal media, LB broth, TBD 

broth and Muller-Hinton media may be tested with the inducer IPTG at the following 
concentrations, 50 ^M, 100 jiM, 200 |iM, 400 |iM, 600 ^M, 800 |iM and 1000 |iM. 
Equal volumes of test media-inducer and cells are added to the wells of a 384 well 
microtiter plate and mixed. The cells are prepared as described above and diluted 1 TOO 

30 in the appropriate media containing the test antibiotic immediately prior to addition to 

the microtiter plate wells. For a control, cells are also added to several wells of each 
media that do not contain inducer, for example 0 iiM IPTG. Cell growth is monitored 
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continuously by incubation at 37°C in a microtiter plate reader monitoring the OD600 
of the wells over an 18-hour period. The percent inhibition of growth produced by each 
concentration of inducer is calculated by comparing the rates of logarithmic growth 
against that exhibited by cells growing in media without inducer. The medium yielding 
5 greatest sensitivity to inducer is selected for use in the assays described below, 

D. Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct 
Induction 

Two-fold dilution series of antibiotics of known mechanism of action are 
generated in the culture media selected for further assay development that has been 

10 supplemented with the antibiotic used to maintain the construct. A panel of test 

antibiotics known to act on different pathways is tested side by side with three to four 
w^ells being used to evaluate the effect of a test antibiotic on cell growth at each 
concentration. Equal volumes of test antibiotic and cells are added to the wells of a 384 
well microtiter plate and mixed. Cells are prepared as described above using the media 

15 selected for assay development supplemented with the antibiotic required to maintain 

the antisense construct and are diluted 1:100 in identical media immediately prior to 
addition to the microtiter plate w^ells. For a control, cells are also added to several wells 
that contain the solvent used to dissolve the antibiotics but no antibiotic. Cell growth is 
monitored continuously by incubation at 37°C in a microtiter plate reader monitoring 

20 the OD600 of the wells over an 18-hour period. The percent inhibition of growth 

produced by each concentration of antibiotic is calculated by comparing the rates of 
logarithmic growth against that exhibited by cells growing in media without antibiotic. 
A plot of percent inhibition against log[antibiotic concentration] allow^s extrapolation of 
an IC50 value for each antibiotic. 

25 E. Measurement of Test Antibiotic Sensitivity in the Presence of Antisense Construct 

Inducer 

The culture media selected for use in the assay is supplemented with inducer at 
concentrations shown to inhibit cell growth by 50 and 80% as described above and the 
antibiotic used to maintain the construct. Two fold dilution series of the panel of test 
30 antibiotics used above are generated in each of these media. Several antibiotics are 

tested side by side with three to four wells being used to evaluate the effects of an 
antibiotic on cell growth at each concentration, in each media. Equal volumes of test 
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antibiotic and cells are added to the wells of a 384 well microtiter plate and mixed. 
Cells are prepared as described above using the media selected for use in the assay 
supplemented with the antibiotic required to maintain the antisense construct. The cells 
are diluted 1:100 into two 50 mL aliquots of identical media containing concentrations 
5 of inducer that have been shown to inhibit cell growth by 50% and 80 % respectively 

and incubated at 37°C with shaking for 2.5 hours. Immediately prior to addition to the 
microtiter plate wells, the cultures are adjusted to an appropriate ODeoo ( typically 0.002) 
by dilution into warm (37°C) sterile media supplemented with identical concentrations 
of the inducer and antibiotic used to maintain the antisense construct. For a control, 

10 cells are also added to several w^ells that contain solvent used to dissolve test antibiotics 

but which contain no antibiotic. Cell growth is monitored continuously by incubation at 
37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour 
period. The percent inhibition of growth produced by each concentration of antibiotic is 
calculated by comparing the rates of logarithmic growth against that exhibited by cells 

15 growing in media without antibiotic, A plot of percent inhibition against log[antibiotic 

concentration] allows extrapolation of an IC50 value for each antibiotic. 

F. Determining the Specificity of the Test Antibiotics 

A comparison of the IC50S generated by antibiotics of known mechanism of 
action under antisense induced and non-induced conditions allows the pathway in w hich 
20 a proliferation-required nucleic acid lies to be identified. If cells expressing an 

antisense nucleic acid against a proliferation-required gene are selectively sensitive to 
an antibiotic acting via a particular pathway, then the gene against which the antisense 
acts is involved in the pathway in which the antibiotic acts. 

G. Identification of Pathway in w^hich a Test Antibiotic Acts 

25 As discussed above, the cell based assay may also be used to determine the 

pathway against which a test antibiotic acts. In such an analysis, the pathways against 
which each member of a panel of antisense nucleic acids acts are identified as described 
above. A panel of cells, each containing an inducible antisense vector against a gene in 
a known proliferation-required pathway, is contacted with a test antibiotic for which it 

30 is desired to determine the pathway on which it acts under inducing an non-inducing 

conditions. If heightened sensitivity is observed in induced cells expressing antisense 
against a gene in a particular pathway but not in induced cells expressing antisense 
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against genes in other pathways, then the test antibiotic acts against the pathway for 
which heightened sensitivity was observed. 

One skilled in the art will appreciate that further optimization of the assay 
conditions, such as the concentration of inducer used to induce antisense expression 
5 and/or the growth conditions used for the assay (for example incubation temperature 

and media components) may further increase the selectivity and/or magnitude of the 
antibiotic sensitization exhibited. 

The following example confirms the effectiveness of the methods described 

above, 

10 EXAMPLE 11 

Identification of the Pathway in which a Proliferation-Required Gene Lies 
Antibiotics of various chemical classes and modes of action were purchased 
from Sigma Chemicals (St. Louis, MO). Stock solutions were prepared by dissolving 
each antibiotic in an appropriate aqueous solution based on information provided by the 

15 manufacturer. The final working solution of each antibiotic contained no more than 

0.2% (w/v) of any organic solvent. To determine their potency against a bacterial strain 
engineered for expression of an antisense against a proliferation-required SOS ribosomal 
protein, each antibiotic was serially diluted two or three fold in growth medium 
supplemented with the appropriate antibiotic for maintenance of the anti-sense 

20 construct. At least ten dilutions were prepared for each antibiotic. 25 |aL aliquots of 

each dilution were transferred to discrete wells of a 384-well microplate (the assay 
plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of 
an antibiotic under each treatment condition (plus and minus inducer). Each assay plate 
contained twenty wells for cell growth controls (growth media replacing antibiotic), ten 

25 wells for each treatment (plus and minus inducer, in this example IPTG). Assay plates 

were usually divided into the two treatments: half the plate containing induced cells and 
an appropriate concentrations of inducer (in this example IPTG) to maintain the state of 
induction, the other half containing non-induced cells in the absence of IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a 

30 construct, from which expression of antisense nucleic acid against rplL and rplJ, which 

encode proliferation-required SOS ribosomal subunit proteins, is inducible in the 
presence of IPTG, were grow^n into exponential growth (OD600 0.2 to 0.3) and then 



-83- 




diluted 1 :100 into fresh media containing either 400 |aM or 0 |aM inducer (IPTG). These 
cuhures were incubated at 37° C for 2.5 hr. After a 2.5 hr incubation, induced and non- 
induced cells were respectively diluted into an assay medium at a final OD600 value of 
0.0004. The medium contained an appropriate concentration of the antibiotic for the 
5 maintenance of the anti-sense construct. In addition, the medium used to dilute induced 

cells was supplemented with 800 IPTG so that addition to the assay plate would 
resuh in a final IPTG concentration of 400 |iM. Induced and non-induced cell 
suspensions were dispensed (25 )al/well) into the appropriate wells of the assay plate as 
discussed previously. The plate was then loaded into a plate reader, incubated at 

10 constant temperature, and cell growth was monitored in each well by the measurement 

of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell 
culture attained a stationary growth phase. For each concentration of antibiotic, a 
percentage inhibition of growth was calculated at the time point corresponding to mid- 
exponential growth for the associated control wells (no anfibiotic, plus or minus IPTG). 

15 For each antibiotic and condhion (plus or minus IPTG), a plot of percent inhibition 

versus log of antibiofic concentration w^as generated and the IC50 determined. A 
comparison of the IC50 for each antibiotic in the presence and absence of IPTG revealed 
whether induction of the antisense construct sensitized the cell to the mechanism of 
action exhibited by the antibiotic. Cells which exhibited a significant (standard 

20 statistical analysis) numerical decrease in the IC50 value in the presence of inducer were 

considered to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of 
the antibiotics used in the analysis, the targets of the antibiotics, the IC50 in the absence 
of IPTG, the IC50 in the presence of IPTG, the concentration units for the lC50s, the 

25 fold increase in IC50 in the presence of IPTG, and whether increased sensitivity was 

observed in the presence of IPTG. 
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The above results demonstrate that induction of an antisense RNA to genes 
encoding SOS ribosomal subunit proteins results in a selective and highly significant 
sensitization of cells to antibiotics that inhibit ribosomal function and protein synthesis. 
The above results further demonstrate that induction of an antisense construct to an 
5 essential gene sensitizes an organism to compounds that interfere with that gene 

products' biological role. This sensitization is restricted to compounds that interfere 
with pathways associated with the targeted gene and it's product. 

Assays utilizing antisense constructs to essential genes can be used to identify 
compounds that specifically interfere with the activity of multiple targets in a pathway. 

10 Such constructs can be used to simultaneously screen a sample against multiple targets 

in one pathway in one reaction (Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells 
may be used to characterize the point of intervention of any compound affecting an 
essential biological pathway including antibiotics with no known mechanism of action. 

15 Another embodiment of the present invention is a method for determining the 

pathway against which a test antibiotic compound is active in which the activity of 
target proteins or nucleic acids involved in proliferation-required pathways is reduced 
by contacting cells with a sublethal concentration of a known antibiotic which acts 
against the target protein or nucleic acid. In one embodiment, the target protein or 

20 nucleic acid is a target protein or nucleic acid corresponding to a proliferation-required 

nucleic acid identified using the methods described above. The method is similar to 
those described above for determining which pathway a test antibiotic acts against 
except that rather than reducing the activity or level of a proliferation-required gene 
product using a sublethal level of antisense to a proliferation-required nucleic acid, the 

25 activity or level of the proliferation-required gene product is reduced using sublethal 

level of a known antibiotic which acts against the proliferation required gene product. 

Interactions between drugs which affect the same biological pathway has been 
described in the literature. For example, Mecillinam (Amdinocillin) binds to and 
inactivates the penicillin binding protein 2 (PBP2, product of the mrdA in E. coli). This 

30 antibiotic inteacts with other antibiotics that inhibit PBP2 as well as antibiotics that 

inhibit other penicillin binding proteins such as PBP3 [(Gutmann, L., Vincent, S., 
Billot-Klein, D., Acar, J.F., Mrena, E., and Williamson, R, (1986) Involvement of 
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penicillin-binding protein 2 with other penicillin-binding proteins in lysis of 
Escherichia coli by some beta-lactam antibiotics alone and in synergistic lytic effect of 
amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 30:906-912), the 
disclosure of which is incorporated herein by reference in its entirety], hiteractions 
5 between drugs could, therefore, involve two drugs that inhibit the same target protein or 

nucleic acid or inhibit different proteins or nucleic acids in the same pathway 
[(Fukuoka, T., Domon, H,, Kakuta, M., Ishii, C, Hirasawa, A., Utsui, Y., Ohya, S., and 
Yasuda, H. (1997) Combination effect between panipenem and vancomycin on highly 
methicillin-resistant Staphylococcus aureus. Japan. J. Antibio. 50:41 1-419; Smith, C.E., 

10 Foleno, B.E., Barrett, J.F., and Prose, M,B. (1997) Assessment of the synergistic 

interactions of levofloxacin and ampicillin against Enterococcus faecium by the 
checkerboard agar dilution and time-kill methods. Diagnos. Microbiol. Infect. Disease 
27:85-92; den Hollander, J.G., Horrevorts, A.M., van Goor, M.L., Verbrugh, H.A., and 
Mouton, J.W. (1997) Synergism between tobramycin and ceftazidime against a resistant 

15 Pseudomonas aeruginosa strain, tested in an in vitro pharmacokinetic model. 

Antimicrobial Agents & Chemotherapy. 41:95-110), the disclosure of all of which are 
incorporated herein by reference in their entireties]. 

Two drugs may interact even though they inhibit different targets. For example, 
the proton pump inhibitor. Omeprazole, and the antibiotic, Amoxycillin, two synergistic 

20 compounds acting together, can cure Helicobacter pylori infection [( Gabryelewicz, A., 

Laszewicz, W., Dzieniszewski, J., Ciok, J., Marlicz, K,, Bielecki, D., Popiela, T., 
Legutko, J., Knapik, Z., Poniewierka, E. (1997) Multicenter evaluation of dual-therapy 
(omeprazol and amoxycillin) for Helicobacter /?y/or/-associated duodenal and gastric 
ulcer (two years of the observation). J. Physiol. Pharmacol. 48 Suppl 4:93-105), the 

25 disclosure of which is incorporated herein by reference in its entirety]. 

The growth inhibition from the sublethal concentration of the known antibiotic 
may be at least about 5%, at least about 8%, at least about lO^/o, at least about 20%, at 
least about 30%, at least about 40%, at least about 50%, at least about 60%, or at least 
about 75%, or more. 

30 Alternatively, the sublethal concentration of the known antibiotic may be 

determined by measuring the activity of the target proliferation-required gene product 
rather than by measuring growth inhibition. 
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Cells are contacted with a combination of each member ot a panel ot known 
antibiotics at a sublethal level and varying concentrations of the test antibiotic. As a 
control, the cells are contacted with varying concentrations of the test antibiotic alone. 
The IC50 of the test antibiotic in the presence and absence of the known antibiotic is 
5 determined. If the IC50s in the presence and absence of the known drug are 

substantially similar, then the test drug and the known drug act on different pathways. 
If the IC50S are substantially different, then the test drug and the known drug act on the 
same pathway. 

Another embodiment of the present invention is a method for identifying a 
10 candidate compound for use as an antibiotic in which the activity of target proteins or 

nucleic acids involved in proliferation-required pathways is reduced by contacting cells 
with a sublethal concentration of a known antibiotic which acts against the target 
protein or nucleic acid. In one embodiment, the target protein or nucleic acid is a target 
protein or nucleic acid corresponding to a proliferation-required nucleic acid identified 
15 using the methods described above. The method is similar to those described above for 

identifying candidate compounds for use as antibiotics except that rather than reducing 
the activity or level of a proliferation-required gene product using a sublethal level of 
antisense to a proliferation-required nucleic acid, the activity or level of the 
proliferation-required gene product is reduced using a sublethal level of a known 
20 antibiotic which acts against the proliferation required gene product. 

The growth inhibition from the sublethal concentration of the known antibiotic 
may be at least about 5%, at least about 8%, at least about 10%, at least about 20%, at 
least about 30%, at least about 40%, at least about 50%, at least about 60%, or at least 
about 75%, or more. 

25 Alternatively, the sublethal concentration of the known antibiotic may be 

determined by measuring the activity of the target proliferation-required gene product 
rather than by measuring growth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a 
panel of known antibiotics at a sublethal level and one or more concentrations of the 

30 test compound. As a control, the cells are contacted with the same concentrations of the 

test compound alone. The IC50 of the test compound in the presence and absence of the 
known antibiotic is determined. If the IC50 of the test compound is substantially 
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different in the presence and absence of the known drug then the test compound is a 
good candidate for use as an antibiotic. As discussed above, once a candidate 
compound is identified using the above methods its structure may be optimized using 
standard techniques such as combinatorial chemistr}'. 

Representative known antibiotics which may be used in each of the above 
methods are provided in the table below. However, it will be appreciated that other 
antibiotics may also be used. 



ANTIBIOTIC 


INHIBITS/TARGET 


RESISTANT 






MUTANTS 



Inhibitors of Transcription 

Rifamycin, 1959 Rifampicin 

Rifabutin Rifaximin 
Streptolydigin 

Streptovaricin 

Actinomycin D+EDTA 



Inhibits initiation of transcription/B-subunit 
RNA polymerase, rpoB 
Accelerates transcription chain 
termination/fi-subunit RNA polymerase 
an acyclic ansamycin, inhibits RNA 
polymerase 

Intercalates between 2 successive G-C 
pairs, rpoB, inhibits RNA synthesis 



Inhibitors of Nucieic Acid Metabolism 



Quinolones, 1962 Nalidixic 
acid Oxolinic acid 

Fluoroquinolones 

Ciprofloxacin, 1983 
Norfloxacin 
Coumerins Novobiocin 



Coumermycin 

Albicidin 
Metronidazole 



a subunit gyrase and/or topoisomerase IV. 
gyrA 

a subunit gyrase, gyrA and/or 
topoisomerase IV (probable target in Staph) 

Inhibits ATPase activity of B-subunit 
gyrase, gyrB 

Inhibits ATPase activity of B-subunit 
gyrase, gyrB 
DNA synthesis 

Causes sinele-strand breaks in DNA 



Inhibitors of Metabolic Pathways 



Sulfonamides, 1932 

Sulfanilamide 
Trimethoprim, 1962 
Showdomycin 



Thiolactomycin 



Psicofuranine 
Triclosan 

Diazoborines Isoniazid, 
Ethionamide 



blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, /o/P 

Inhibits dihydrofolate reductase, yb/^ 

Nucleoside analogue capable of alkylating 

sulfhydr>'l groups, inhibitor of thymidylate 

synthetase 

type II fatty acid synthase inhibitor 



Adenosine glycoside antibiotic, target is 
GMP synthetase 
Inhibits fatty acid synthesis 
heterocyclic, contains boron, inhibit fatt\ 
acid synthesis, enoyl-ACP reductase,/^/?/ 



rpoB, crp, cyaA 

rpoB 

rpoB 

pldA 

gvrAurB, icd, sloB 
gyrA 

norA (efflux in Staph) 
hipQ 

gyrB, cysB, cysE, nov, 

ompA 
gyrB, hisW 

tsx (nucleoside channel) 
nar 



JolP, gpi, pabA, pabB, 
pabC 
folA, thy A 
nupC, pnp 



emrB 

fadB, emrB due to gene 

dosage 
guaA,B 



jabi (envM) 
Jab! (envM) 
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Inhibitors of Translation 

Phenylpropanoids 

Chloramphenicol, 1947 



Tetracyclines, 1948, type II 

polyketides 

Minocycline 

Doxycycline 
Macrolides (type I polyketides) 

Erythromycin, 1950 

Carbomycin, Spiramycin 

etc 

Aminoglycosides Streptomycin, 
1944 

Neomycin 

Spectinomycin 
Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 
Lincosamides 

Lincomycin, 1955 

Clindamycin 
Streptogramins Virginiamycin, 

1955 Pristinamycin 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin (Mocimycin) 

Pulvomycin 
Thiopeptin 

Tiamulin 
Negamycin 

Oxazolidinones Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids Mupirocin 
(Bactroban) 



Binds to ribosomal peptidyl transfer center 

preventing peptide translocation/ binds to rm, cmlA, marA, ompF, 
S6, L3, L6, L14, L16, L25, L26, L27, but ompR 
preferentially to L16 

Binding to SOS ribosomal subunit, "A" site clmA (cmr), mar, ompF 
on SOS subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomal subunit, 23S 
rRNA, blocks peptide translocation, LI 5, 

L4 , L 1 2 rm, rplC^ rplD^ rpi V , 

mac 

Irreversible binding to SOS ribosomal 

subunit, prevents translation or causes rpsL, strCM, ubiF 

mistranslation of mRNA/16S rRNA atpA-E, ecfB, 

hemAC,D,E,G, top A, 
rpsC,D,E, rrn, spcB 
atpA-atpE, cpxA, ecfB, 

hemA,B,L, top A 
ksgA,B,C,D, rplB,K, 

rpsI,N,M,R 
rplF, lib IF 
cpxA 
rpsL 



Binding to 50 S ribosomal subunit, blocks 

peptide translocation linB, rplN,0. rpsG 

2 components, Streptogramins A&B, bind 
to the 50S ribosomal subunit blocking 
peptide translocation and peptide bond 
formation 

Inhibition of elongation factor G (EF-G) fusA 

prevents peptide translocation 
Inhibition of elongation factor TU (EF-Tu), iufA,B 
prevents peptide bond formation 
Binds to and inhibits EF-TU 
Sulfur-containing antibiotic, inhibits protein rplE 

synthesis, EF-G 
Inhibits protein synthesis rpIC, rpID 

Inhibits termination process of protein prJB 

synthesis 
23SrRNA 



pdx 

Inhibits protein synthesis, nitroreductases nJnA,B 
convert nitrofurantoin to highly reactive 
electrophilic intermediates which attack 
bacterial ribosomal proteins non- 
specifically 

Inhibition of isoleucyl tRNA synthetase- ileS 
used for Staph, topical cream, nasal 
spray 
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Indolmycin 
Viomycin 



Thiopeptides 

Thiostrepton 

Micrococcin 



Inhibits tryptophanyl-tRNA synthetase 



Binds to L 1 1 -23S RNA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



trpS 

rrniA (23 S rRNA 
methyltransferase; 
mutant has slow 
growth rate, slow^ 
chain elongation rate, 
and viomycin 
resistance) 



Inhibitors of Cell Walls/Membranes 



B-Iactams 

Penicillin, 1929 Ampicillin 

Methicillin, 1960 



Cephalosporins, 1962 
Mecillinam (amdinocillin) 

Aztreonam (Furazlocillin) 
Bacilysin, Tetaine 
Glycopeptides Vancomycin, 1955 

Polypeptides Bacitracin 

Cyclic lipopeptide Daptomycin, 
1980 



Cyclic polypeptides Polymixin, 

1939 
Fosfomycin, 1969 



Cycloserine 
Alafosfalin 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 
glycosidases (PBPs), of the 12 PBPs only 2 
are essential: mrcIA (PBP2) and ftsi fphpB. 
PBP3) 



Binds to and inactivates PBP2 {mrdA) 
Inactivates PBP3 {ftsI) 

Dipeptide, inhib glucosamine synthase 
Inhib G+ cell wall syn, binds to terminal 
D-ala-D-ala of pentapeptide, 
Prevents dephosphor>^lation and 
regeneration of lipid carrier 
Disrupts multiple aspects of membrane 
function, including peptidoglycan 
synthesis, lipoteichoic acid synthesis, and 
the bacterial membrane potential 
Surfactant action disrupts cell membrane 
lipids, binds lipid A mioety of LPS 
Analogue of P-enolpyruvate, inhibits l" 
step in peptidoglycan synthesis - UDP-N- 
acetylglucosamine enolpyruvyl 
transferase, murA. Also acts as 
Immunosuppressant 
Prevents formation of D-ala dimer, 
inhibits D-ala ligase, ddlA,B 
phosphonodipeptide, cell wall synthesis 
inhibitor, potentiator of p-lactams 



Inhibitors of Protein Processing/Transport 

Globomycin Inhibits signal peptidase II (cleaves 

prolipoproteins subsequent to lipid 
modification, ispA 



ampC, ampD, ampE, 
envZ. galV, hipA, 
hipQ, ompC, ompF, 
ompR, ptsi, rfa, 
toID, tolE 

tonB 

alaS, argS, crp, cyaA, 
envB, mrdA,B, 
mreB.CD 

dppA 



rfa 



pmrA 

murA, crp, cyaA gIpT, 
hip A, pis I, uhpT 



hip A, eye A 
pep A, tpp 

Ipp, dnciE 
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EXAMPLE 12 



Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species Using the E. 
coll Expression Vectors or Expression Vectors Functional in Bacterial Species other than 

E. coli, 

5 The above methods were validated using antisense nucleic acids which inhibit 

the growth of E. coli which were identified using methods similar to those described 
above. Expression vectors which inhibited growth of E. coli upon induction of 
antisense RNA expression with IPTG were transformed directly into Enterobacter 
cloacae, Klebsiella pneumonia or Salmonella typhimurium. The transformed cells were 

10 then assayed for growth inhibition according to the method of Example 1 . After growth 

in liquid culture, cells were plated at various serial dilutions and a score determined by 
calculating the log difference in growth for INDUCED vs. UNINDUCED antisense 
RNA expression as determined by the maximum 10 fold dilution at which a colony was 
observed. The results of these experiments are listed below in Table VI. If there was 

15 no effect of antisense RNA expression in an organism, the clone is minus in Table VI . 

In contrast, a positive in Table VI means that at least 1 0 fold more cells were required to 
observe a colony on the induced plate than on the non-induced plate under the 
conditions used and in that organism. 

Sixteen of the construts were found to inhibit growth in all the organisms tested 

20 upon induction of antisense RNA expression with IPTG. Those skilled in the art will 

appreciate that a negative result in a heterologous organism does not mean that that 
organism is missing that gene nor does it mean that the gene is unessential. How^ever, a 
positive result means that the heterologous organism contains a homologous gene which 
is required for proliferation of that organism. The homologous gene may be obtained 

25 using the methods described herein. Those cells that are inhibited by antisense may be 

used in cell based assays as described herein for the identification and characterization 
of compounds in order to develop antibiotics effective in these organisms. Those 
skilled in the art will appreciate that an antisense molecule which w^orks in the organism 
from which it was obtained will not always work in a heterologous organism. 

30 
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TABLE VI 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit 

ProHferation in E. coli 



Mol. No. 


S. typlumimum 


E. cloacae 


K, pneumoniae 


EcXAOOl 


+ 


+ 

„ . - .„....,.4 


- 


EcXA004 


- 


- 


- 


EcXAOOS 


+ 




+ 


EcXA006 








EcXAOO? 




+ 




EcXAOOS 


+ 




+ 


EcXAOlO 


+ 




+ 


EcXAOl 1 


+ 1 


EcXA012 




+ 




EcXAOl 3 


+ 


+ 


+ 


EcXAOl 4 


+ 






EcXAOl 5 




" " 

+ 




EcXAOl 6 


+ 


+ 


+ 


EcXAOl? 


+ 


+ 




EcXAOl 8 


+ 






EcXAOl 9 


+ 


1 + 


+ 


EcXA020 


+ 




+ 


EcXA021 


+ 




+ 


EcXA023 


+ 


+ 


+ 


EcXA024 


+ 




+ 


EcXA025 
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EcXA026 


T 


+ 




EcXA027 


_L 

T 


+ 


+ 


EcXA028 






EcXA029 








EcXA030 






+ 


EcXA031 








EcXA032 


i 

+ - ; 


EcXA033 


+ 






EcXA034 


+ 




+ 


EcXA035 








EcXA036 






+ 


EcXA037 




EcXA038 








EcXA039 








EcXA041 








EcXA042 


4- 


+ 


EcXA044 








EcXA045 






EcXA046 




EcXA047 








EcXA048 


1 




EcXA049 


+ i 




EcXAOSO 
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MoL No 


S tv n h iiii uriuttt 






EcXAOSl 


+ 






EcXA052 


+ 






EcXA053 






+ 


EcXA054 


1 


+ 


EcXA055 


1 i 
i - 1 



EXAMPLE 13 

Use of Identified Exogenous Nucleic Acid Sequences as Probes 
5 The identified sequence of the present invention can be used as probes to obtain 

the sequence of additional genes of interest from a second organism. For example, 
probes to potential bacterial target proteins may be hybridized to nucleic acids from 
other organisms including other bacteria and higher organisms, to identify homologous 
sequences. Such hybridization might indicate that the protein encoded by the gene to 

10 which the probe corresponds is found in humans and therefore not necessarily a good 

drug target. Alternatively, the gene can be conserved only in bacteria and therefore 
would be a good drug target for a broad spectrum antibiotic or antimicrobial. 

Probes derived from the identified nucleic acid sequences of interest or portions 
thereof can be labeled with detectable labels familiar to those skilled in the art, including 

15 radioisotopes and non-radioactive labels, to provide a detectable probe. The detectable 

probe can be single stranded or double stranded and can be made using teclaniques known 
in the art, including in vitro transcription, nick translation, or kinase reactions. A nucleic 
acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it can be 

20 denatured prior to contacting the probe. In some applications, the nucleic acid sample can 

be immobilized on a surface such as a nitrocellulose or nylon membrane. The nucleic acid 
sample can comprise nucleic acids obtained from a variety of sources, including genomic 
DNA, cDNA libraries, RNA, or tissue samples. 
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Procedures used to detect the presence of nucleic acids capable of hybridizing to 
the detectable probe include well known tecliniques such as Southern blotting, Nonhem 
blotting, dot blotting, colony hybridization, and plaque hybridization. hi some 
applications, the nucleic acid capable of hybridizing to the labeled probe can be cloned 
5 into vectors such as expression vectors, sequencing vectors, or in vitro transcription 

vectors to facilitate the characterization and expression of the hybridizing nucleic acids in 
the sample. For example, such techniques can be used to isolate, purify and clone 
sequences from a genomic library, made from a variety of bacterial species, which are 
capable of hybridizing to probes made from the sequences identified in Examples 5 and 6. 

10 EXAMPLE 14 

Preparation of PGR Primers and Amplification of DNA 
The identified E. coli genes corresponding directly to or located within the operon 
of nucleic acid sequences required for proliferation or portions thereof can be used to 
prepare PGR primers for a variety of applications, including the identification or isolation 

15 of homologous sequences from other species, for example S. typhimuhum, E. cloacae, and 

Klebsiella pneumoniae, w4iich contain part or all of the homologous genes. Because 
homologous genes are related but not identical in sequence, those skilled in the art will 
often employ degenerate sequence PGR primers. Such degenerate sequence primers are 
designed based on conserved sequence regions, either known or suspected, such as 

20 conserved coding regions. The successful production of a PGR product using degenerate 

probes generated from the sequences identified herein w^ould indicate the presence of a 
homologous gene sequence in the species being screened. The PGR primers are at least 10 
bases, and preferably at least 20 bases in length. More preferably, the PGR primers are at 
least 20-30 bases in length. In some embodiments, the PGR primers can be more than 30 

25 bases in length. It is preferred that the primer pairs have approximately the same G/G 

ratio, so that melting temperatures are approximately the same. A variety of PGR 
techniques are familiar to those skilled in the art. For a review of PGR technology, see 
Molecular Gloning to Genetic Engineering White, B.A. Ed. in Methods in Molecular 
Biology 67: Humana Press, Totowa 1997. W^ien the entire coding sequence of the target 

30 gene is known, the 5' and 3' regions of the target gene can be used as the sequence source 

for PGR probe generation. In each of these PGR procedures, PGR primers on either side 
of the nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid 
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sample along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu 
polymerase, or Vent polymerase. The nucleic acid in the sample is denatured and the PCR 
primers are specifically hybridized to complementary nucleic acid sequences in the 
sample. The hybridized primers are extended. Thereafter, another cycle of denaturation, 
5 hybridization, and extension is initiated. The cycles are repeated multiple times to produce 

an amplified fragment containing the nucleic acid sequence between the primer sites. 

EXAMPLE 15 
Inverse PCR 

The technique of inverse polymerase chain reaction can be used to extend the 

10 known nucleic acid sequence identified in Examples 5 and 6. The inverse PCR reaction is 

described generally by Ochman et aL, in Ch. 10 of PCR Technology: Principles and 
Applications for DNA Amplification, (Henry A. Erlich, Ed.) W.H. Freeman and Co. 
(1992). Traditional PCR requires two primers that are used to prime the synthesis of 
complementary strands of DNA. In inverse PCR, only a core sequence need be known. 

1 5 Using the sequences identified as relevant from the techniques taught in Examples 

5 and 6 and applied to other species of bacteria, a subset of exogenous nucleic sequences 
are identified that correspond to genes or operons that are required for bacterial 
proliferation. In species for which a genome sequence is not knov^n, the teclinique of 
inverse PCR provides a method for obtaining the gene in order to detemiine the sequence 

20 or to place the probe sequences in full context to the target sequence to which the 

identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an 
appropriate restriction enzyme so as to create fragments of nucleic acid that contain the 
identified sequence as w^ell as unknowTi sequences that flank the identified sequence. 

25 These fragments are then circularized and become the template for the PCR reaction. PCR 

primers are designed in accordance with the teachings of Example 1 5 and directed to the 
ends of the identified sequence are synthesized. The primers direct nucleic acid synthesis 
away from the known sequence and toward the unknown sequence contained within the 
circularized template. After the PCR reaction is complete, the resulting PCR products can 

30 be sequenced so as to extend the sequence of the identified gene past the core sequence of 

the identified exogenous nucleic acid sequence identified. In this manner, the full 
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sequence of each novel gene can be identified. Additionally the sequences of adjacent 
coding and noncoding regions can be identified. 

EXAMPLE 16 

Identification of Genes Required for Staphylococcus aureus Proliferation 
5 Genes required for proliferation in Staphylococcus aureus are identified according 

to the methods described above. 

EXAMPLE 17 

Identification of Genes Required for Neisseria gonorrhoeae Proliferation 
Genes required for proliferation in Neisseria gonorrhoeae are identified according 
10 to the methods described above. 

EXAMPLE 18 

Identification of Genes Required for Pseudomonas aeruginosa Proliferation 
Genes required for proliferation in Pseudomonas aeruginosa are identified 
according to the methods described above. 
15 EXAMPLE 19 

Identification of Genes Required for Enterococcus faecalis Proliferation 
Genes required for proliferation in Enterococcus faecalis are identified according 
to the methods described above. 

EXAMPLE 20 

20 Identification of Genes Required for Haemophilus influenzae Proliferation 

Genes required for proliferation in Haemophilus influenzae are identified 
according to the methods described above. 

EXAMPLE 21 

Identification of Genes Required for Salmonella typhimurium Proliferation 
25 Genes required for proliferation in Salmonella typhimurium are identified 

according to the methods described above. 

EXAMPLE 22 

Identification of Genes Required for Helicobacter pylori Proliferation 
Genes required for proliferation in Helicobacter pylori are identified according to 
30 the methods described above. 

EXAMPLE 23 

Identification of Genes Required for Mycoplasma pneumoniae Proliferation 
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Genes required for proliferation in Mycoplasma pneumoniae are identified 
according to the methods described above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
5 Genes required for proHferation in Plasmodium ovale are identified according to 

the methods described above. 

EXAMPLE 25 

Identification of Genes Required for Saccharomvces cerevisiae Proliferation 
Genes required for proliferation in Saccharomvces cerevisiae are identilied 
10 according to the methods described above. 

EXAMPLE 26 

Identification of Genes Required for Entamoeba histolviica Proliferation 
Genes required for proliferation in Entamoeba histolytica are identified according 
to the methods described above. 
15 EXAMPLE 27 

Identification of Genes Required for Candida albicans Proliferation 
Genes required for proliferation in Candida albicans are identified according to 
the methods described above. 

EXAMPLE 28 

20 Identification of Genes Required for Klebsiella pneumoniae Proliferation 

Genes required for proliferation in Klebsiella pneumoniae are identified according 
to the methods described above. 

EXAMPLE 29 

Identification of Genes Required for Salmonella typhi Proliferation 
25 Genes required for proliferation in Salmonella typhi are identified according to the 

methods described above. 

EXAMPLE 30 

Identification of Genes Required for Salmonella paratyphi Proliferation 
Genes required for proliferation in Salmonella paratyphi are identified according 
30 to the methods described above. 

EXAMPLE 31 

Identification of Genes Required for Salmonella cholerasuis Proliferation 
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Genes required for proliferation in Salmonella cholerasiiis are identified according 
to the methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis ProHferation 
5 Genes required for proHferation in Staphylococcus epidermis are identified 

according to the methods described above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tuberculosis Proliferation 
Genes required for proliferation in Mycobacterium tuberculosis are identified 
10 according to the methods described above. 

EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according 
to the methods described above. 
15 EXAMPLE 35 

Identification of Genes Required for Treponema pallidum Proliferation 
Genes required for proliferation in Treponema pallidum are identified according to 
the methods described above. 

EXAMPLE 36 

20 Identification of Genes Required for Bacillus anthracis Proliferation 

Genes required for proliferation in Bacillus anthracis are identified according to 
the methods described above. 

EXAMPLE 37 

Identification of Genes Required for Yersinia pestis Proliferation 
25 Genes required for proliferation in Yersinia pestis are identified according to the 

methods described above. 

EXAMPLE 38 

Identification of Genes Required for Clostridium botidinum Proliferation 
Genes required for proliferation in Clostridium botulinum are identified according 
30 to the methods described above. 

EXAMPLE 39 

Identification of Genes Required for Campylobacter jejuni Proliferation 



-101- 



Genes required for proliferation in Campylobacter jejuni are identified according 
to the methods described above. 

EXAMPLE 40 

Identification of Genes Required for Chlamydia trachomatis Proliferation 
5 Genes required for proliferation in Chlamydia trachomatis are identified 

according to the methods described above. 

Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule 

10 libraries to identify compounds useful to identify antibiotics, the sequences themselves can 

be used as therapeutic agents. Specifically, the identified exogenous sequences in an 
antisense orientation can be provided to an indi\'idual to inhibit the translation of a 
bacterial target gene. 

Generation of Antisense Therapeutics from Identified Exogenous Sequences 

1 5 The sequences of the present invention can be used as antisense therapeutics for 

the treatment of bacterial infections or simply for inhibition of bacterial grov^th in vitro 
or in vivo. The therapy exploits the biological process in cells where genes are 
transcribed into messenger RNA (mRNA) that is then translated into proteins. 
Antisense RNA technology contemplates the use of antisense oligonucleotides directed 

20 against a target gene that will bind to its target and decrease or inhibit the translation of 

the target mRNA. In one embodiment, antisense oligonucleotides can be used to treat 
and control a bacterial infection of a cell culture containing a population of desired cells 
contaminated with bacteria. In another embodiment, the antisense oligonucleotides can 
be used to treat an organism with a bacterial infection. 

25 Antisense oligonucleotides can be synthesized from any of the sequences of the 

present invention using methods well known in the art. In a preferred embodiment, 
antisense oligonucleotides are synthesized using artificial means. Uhlmann & 
Peymann, Chemical Rev. 90:543-584 (1990) review antisense oligonucleotide 
technology in detail. Modified or unmodified antisense oligonucleotides can be used as 

30 therapeutic agents. Modified antisense oligonucleotides are preferred since it is well 

known that antisense oligonucleotides are extremely unstable. Modification of the 
phosphate backbones of the antisense oligonucleotides can be achieved by substituting 
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the internucleotide phosphate residues with methylphosphonates, phosphorothioates, 
phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs such as 
siloxane bridges, carbonate brides, thioester bridges, as well as many others know^n in 
the art. The preparation of certain antisense oligonucleotides with modified 
5 internucleotide linkages is described in U.S. Patent No. 5,142,047, hereby incorporated 

by reference. 

Modifications to the nucleoside units of the antisense oligonucleotides are also 
contemplated. These modifications can increase the half-life and increase cellular rates 
of uptake for the oligonucleotides in vivo. For example, a-anomeric nucleotide units 

10 and modified bases such as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-l- 

phenylribofuranose, and M A^-ethano-5-methyl-cytosine are contemplated for use in 
the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic 
acids. Peptide nucleic acids (PNA) have been developed to hybridize to single and double 

1 5 stranded nucleic acids. PNA are nucleic acid analogs in which the entire deoxyribose- 

phosphate backbone has been exchanged with a chemically completely different, but 
structurally homologous, polyamide (peptide) backbone containing 2-aminoethyl glycine 
units. Unlike DNA, w^hich is highly negatively charged, the PNA backbone is neutral. 
Therefore, there is much less repulsive energy between complementary strands in a PNA- 

20 DNA hybrid than in the comparable DNA-DNA hybrid, and consequently they are much 

more stable. PNA can hybridize to DNA in either a Watson/Crick or Hoogsteen fashion 
(Demidov et al, Proc. Natl Acad. Sci. U.S.A. 92:2637-2641, 1995; Egholm, Nature 
365:566-568, 1993; Nielsen et al., Science 254:1497-1500, 1991; Dueholm et al., New J. 
Chem. 21:19-31, 1997). 

25 Molecules called PNA "clamps" have been synthesized which have tw^o identical 

PNA sequences joined by a flexible hairpin linker containing three 8-amino-3,6- 
dioxaoctanoic acid units. When a PNA clamp is mixed with a complementary homopurine 
or homopyrimidine DNA target sequence, a PNA-DNA-PNA triplex hybrid can form 
which has been shown to be extremely stable (Bentin et al.. Biochemistry 35:8863-8869, 

30 1996; Egholm et al.. Nucleic Acids Res. 23:217-222, 1995; Griffith et al, J. Am. Chem. 

Soc, 117:831-832, 1995). 
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The sequence-specific and high affinity duplex and triplex binding of PNA have 
been extensively described (Nielsen et al., Science 254:1497-1500, 1991; Egholm et al., J. 
Am. Chem. Soc, 114:9677-9678, 1992; Egholm et al, Nature 365:566-568, 1993; 
Almarsson et al., Proc. Nail. Acad Sci. U.S.A. 90:9542-9546, 1993; Demidov et al., Proc. 
5 Natl. Acad Sci, U.S.A. 92:2637-2641, 1995). They have also been shown to be resistant to 

nuclease and protease digestion (Demidov et al., Biochem. Pharm. 48:1010-1313, 1994). 
PNA has been used to inhibit gene expression (Hanvey et al.. Science 258:1481- 
1485,1992; Nielsen et al, NucL Acids. Res., 21:197-200, 1993; Nielsen et al., Gene 
149:139-145, 1994; Good & Nielsen, Science, 95: 2073-2076, 1998; all of which are 

10 hereby incorporated by reference), to block restriction enzyme activity (Nielsen et al., 

supra., 1993), to act as an artificial transcription promoter (Mollegaard, Proc. Natl. Acad. 
Sci. U.SA. 91:3892-3895, 1994) and as a pseudo restriction endonuclease (Demidov et al., 
Nucl Acids. Res. 21:2103-2107, 1993). Recently, PNA has also been shown to have 
antiviral and antitumoral activity mediated through an antisense mechanism (Norton, 

15 Nature BiotechnoL, 14:615-619, 1996; Hirscliman et al., J. Investig. Med 44:347-35K 

1996). PNAs have been linked to various peptides in order to promote PNA entr\^ into 
cells (Basu et al., Bioconj. Chem. 8:481-488, 1997; Pardridge et al., Proc. Natl Acad Sci. 
USA. 92:5592-5596, 1995). 

The antisense oligonucleotides contemplated by the present invention can be 

20 administered by direct application of oligonucleotides to a target using standard 

techniques well known in the art. The antisense oligonucleotides can be generated 
within the target using a plasmid, or a phage. Alternatively, the antisense nucleic acid 
may be expressed from a sequence in the chromosome of the target cell. It is further 
contemplated that contemplated that the antisense oligonucleotide contemplated are 

25 incorporated in a ribozyme sequence to enable the antisense to specifically bind and cleave 

its target mRNA. For technical applications of riboz>TOe and antisense oligonucleotides 
see Rossi et al., Pharmacol. Then 50(2):245-254, (1991), which is hereby incorporated 
by reference. The present invention also contemplates using a retron to introduce an 
antisense oligonucleotide to a cell. Retron technology is exemplified by U.S. Patent 

30 No. 5,405,775, which is hereby incorporated by reference. Antisense oligonucleotides 

can also be delivered using liposomes or by electroporation techniques which are well 
known in the art. 
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The antisense nucleic acids of the present invention can also be used to design 
antibiotic compounds comprising nucleic acids which function by intracellular triple helix 
formation. Triple helix oligonucleotides are used to inhibit transcription from a genome. 
The sequences identified as required for proliferation in the present invention, or portions 
5 thereof, can be used as templates to inhibit microorganism gene expression in individuals 

infected with such organisms. Traditionally, homopurine sequences were considered the 
most useful for triple helix strategies. However, homopyrimidine sequences can also 
inhibit gene expression. Such homopyrimidine oligonucleotides bind to the major groove 
at homopurine :homopyrimidine sequences. Thus, both types of sequences based on the 

10 sequences of the present invention that are required for proliferation are contemplated for 

use as antibiotic compound templates. 

The antisense oligonucleotides of this example employ the identified sequences 
of the present invention to induce bacterial cell death or at least bacterial stasis by 
inhibiting target gene translation, Antisense oligonucleotides containing from about 8 

15 to 40 bases of the sequences of the present invention have sufficient complementary to 

form a duplex with the target sequence under physiological conditions. 

To kill bacterial cells or inhibit their growth, the antisense oligonucleotides are 
applied to the bacteria or to the target cells under conditions that facilitate their uptake. 
These conditions include sufficient incubation times of cells and oligonucleotides so 

20 that the antisense oligonucleotides are taken up by the cells, hi one embodiment, an 

incubation period of 7-10 days is sufficient to kill bacteria in a sample. An optimum 
concentration of antisense oligonucleotides is selected for use. 

The concentration of antisense oligonucleotides to be used can vary depending 
on the type of bacteria sought to be controlled, the nature of the antisense 

25 oligonucleotide to be used, and the relative toxicity of the antisense oligonucleotide to 

the desired cells in the treated culture. Antisense oligonucleotides can be introduced to 
cell samples at a number of different concentrations preferably betw^een IxlO'^^M to 1x10' 
"^M. Once the minimum concentration that can adequately control gene expression is 
identified, the optimized dose is translated into a dosage suitable for use in vivo. For 

30 example, an inhibiting concentration in culture of 1x10'^ translates into a dose of 

approximately 0.6 mg/kg body weight. Levels of oligonucleotide approaching 100 mg/kg 
body weight or higher may be possible after testing the toxicity of the oligonucleotide in 
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laboratory animals. It is additionally contemplated that cells from the subject are removed, 
treated with the antisense oligonucleotide, and reintroduced into the subject. This range is 
merely illustrative and one of skill in the art are able to determine the optimal 
concentration to be used in a given case. 
5 After the bacterial cells have been killed or controlled in a desired culture, the 

desired cell population may be used for other purposes. 

EXAMPLE 41 

The following example demonstrates the ability of an E. coli antisense 
oligonucleotide to act as a bactericidal or bacteriostatic agent to treat a contaminated 
10 cell culture system. The application of the antisense oligonucleotides of the present 

invention are thought to inhibit the translation of bacterial gene products required for 
proliferation. 

The antisense oligonucleotide of this example corresponds to a 30 base 
phophorothioate modified oligodeoxynucelotide complementary to a nucleic acid 

15 involved in proliferation, such as Molecule Number EcXAOOl. A sense 

oligodeoxynucelotide complementary to the antisense sequence is synthesized and used 
as a control. The oligonucleotides are synthesized and purified according to the 
procedures of Matsukura, et aL, Gene 72:343 (1988). The lest oligonucleotides are 
dissolved in a small volume of autoclaved water and added to culture medium to make a 

20 1 00 micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two 
patients and cultured according standard procedures well known in the art. The culture 
is contaminated with the K-12 strain of E. coli and incubated at 37°C overnight to 
establish bacterial infection. 

25 The control and antisense oligonucleotide containing solutions are added to the 

contaminated cultures and monitored for bacterial growth. After a 10 hour incubation 
of culture and oligonucleotides, samples from the control and experimental cultures are 
drawn and analyzed for the translation of the target bacterial gene using standard 
microbiological techniques well known in the art. The target E. coli gene is found to be 

30 translated in the control culture treated with the control oligonucleotide, however, 

translation of the target gene in the experimental culture treated with the antisense 
oligonucleotide of the present invention is not detected or reduced. 
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EXAMPLE 42 



A subject suffering from an £. coli infection is treated with the antisense 
oligonucleotide preparation of Example 39. The antisense oligonucleotide is provided 
in a pharmaceutically acceptable carrier at a concentration effective to inhibit the 
5 translation of the target gene. The present subject is treated with a concentration of 

antisense oligonucleotide sufficient to achieve a blood concentration of about 100 
micromolar. The patient receives daily injections of antisense oligonucleotide to 
maintain this concentration for a period of 1 week. At the end of the week a blood 
sample is drawn and analyzed for the presence or absence using standard techniques 
10 well known in the art. There is no detectable evidence of E. coli and the treatment is 

terminated. 

EXAMPLE 43 

Preparation and use of Triple Helix Probes 
The sequences of microorganism genes required for proliferation of the present 
15 invention are scanned to identify 10-mer to 20-mer homopyrimidine or homopurine 

stretches that could be used in triple-helix based strategies for inhibiting gene expression. 
Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in inhibiting gene expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into a population of bacterial cells 
20 that normally express the target gene. The oligonucleotides may be prepared on an 

oligonucleotide synthesizer or they may be purchased commercially from a company 
specializing in custom oligonucleotide synthesis, such as GENSET, Paris, France. 

The oligonucleotides can be introduced into the cells using a \'ariety of methods 
known to those skilled in the art, including but not limited to calcium phosphate 
25 precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection or native 

uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such 
as monitoring growth levels as compared to untreated cells using optical density 
measurements. The oligonucleotides that are effective in inliibiting gene expression in 
30 cultured cells can then be introduced in vivo using the tecliniques well known in that art at 

a dosage level shown to be effective. 
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In some embodiments, the natural (beta) anomers of the ohgonucleotide units can 
be replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. 
Further, an intercalating agent such as ethidium bromide, or the like, can be attached to the 
3' end of the alpha oligonucleotide to stabilize the triple helix. For information on the 
5 generation of oligonucleotides suitable for triple helix formation see Griffin et al. (Science 

245:967-971 (1989), which is hereby incorporated by this reference). 

EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens bv PGR 
Classical bacteriological methods for the detection of various bacterial species 
10 are time consuming and costly. These methods include growing the bacteria isolated 

from a subject in specialized media, cultivation on selective agar media, followed by a 
set of confirmation assays that can take from 8 to 10 days or longer to complete. Use of 
the identified sequences of the present invention provides a method to dramatically 
reduce the time necessary to detect and identify specific bacterial species present in a 
1 5 sample. 

In one exemplary method, bacteria are grown in enriched media and DNA samples 
are isolated from specimens of, for example, blood, urine, stool, saliva or central nerx'ous 
system fluid by conventional methods. A panel of PGR primers based on identified 
sequences unique to various species of microorganisms are then utilized in accordance 

20 with Example 12 to amplify DNA of approximately 100-200 bases in length from the 

specimen. A separate PGR reaction is set up for each pair of PGR primers and after the 
PGR reaction is complete, the reaction mixtures are assayed for the presence of PGR 
product. The presence or absence of bacteria from the species to which the PGR primer 
pairs belong is determined by the presence or absence of a PGR product in the various test 

25 PGR reaction tubes. 

Although the PGR reaction is used to assay the isolated sample for the presence of 
various bacterial species, other assays such as the Southern blot hybridization are also 
contemplated. 

All documents cited herein are incorporated herein by reference in their entireties. 

30 
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